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Foreword 
by Lev Beklemishev, Moscow 


The field of mathematical logic—evolving around the notions of logical 
validity, provability, and computation—was created in the first half of the 
previous century by a cohort of brilliant mathematicians and philosophers 
such as Frege, Hilbert, Godel, Turing, Tarski, Malcev, Gentzen, and some 
others. The development of this discipline is arguably among the highest 
achievements of science in the twentieth century: it expanded mathe- 
matics into a novel area of applications, subjected logical reasoning and 
computability to rigorous analysis, and eventually led to the creation of 
computers. 

The textbook by Professor Wolfgang Rautenberg is a well-written in- 
troduction to this beautiful and coherent subject. It contains classical 
material such as logical calculi, beginnings of model theory, and Gédel’s 
incompleteness theorems, as well as some topics motivated by applica- 
tions, such as a chapter on logic programming. The author has taken 
great care to make the exposition readable and concise; each section is 
accompanied by a good selection of exercises. 

A special word of praise is due for the author’s presentation of Gédel’s 
second incompleteness theorem, in which the author has succeeded in 
giving an accurate and simple proof of the derivability conditions and 
the provable %4-completeness, a technically difficult point that is usually 
omitted in textbooks of comparable level. This work can be recommended 
to all students who want to learn the foundations of mathematical logic. 


Preface 


The third edition differs from the second mainly in that parts of the 
text have been elaborated upon in more detail. Moreover, some new 
sections have been added, for instance a separate section on Horn formulas 
in Chapter 4, particularly interesting for logic programming. The book 
is aimed at students of mathematics, computer science, and linguistics. 
It may also be of interest to students of philosophy (with an adequate 
mathematical background) because of the epistemological applications of 
Gédel’s incompleteness theorems, which are discussed in detail. 

Although the book is primarily designed to accompany lectures on a 
graduate level, most of the first three chapters are also readable by under- 
graduates. The first hundred twenty pages cover sufficient material for an 
undergraduate course on mathematical logic, combined with a due por- 
tion of set theory. Only that part of set theory is included that is closely 
related to mathematical logic. Some sections of Chapter 3 are partly 
descriptive, providing a perspective on decision problems, on automated 
theorem proving, and on nonstandard models. 

Using this book for independent and individual study depends less on 
the reader’s mathematical background than on his (or her) ambition to 
master the technical details. Suitable examples accompany the theorems 
and new notions throughout. We always try to portray simple things 
simply and concisely and to avoid excessive notation, which could divert 
the reader’s mind from the essentials. Line breaks in formulas have been 
avoided. To aid the student, the indexes have been prepared very carefully. 
Solution hints to most exercises are provided in an extra file ready for 
download from Springer’s or the author’s website. 

Starting from Chapter 4, the demands on the reader begin to grow. The 
challenge can best be met by attempting to solve the exercises without 
recourse to the hints. The density of information in the text is rather high; 
a newcomer may need one hour for one page. Make sure to have paper and 
pencil at hand when reading the text. Apart from sufficient training in 
logical (or mathematical) deduction, additional prerequisites are assumed 
only for parts of Chapter 5, namely some knowledge of classical algebra, 
and at the very end of the last chapter some acquaintance with models of 
axiomatic set theory. 


Vil 


vili Preface 


On top of the material for a one-semester lecture course on mathemat- 
ical logic, basic material for a course in logic for computer scientists is 
included in Chapter 4 on logic programming. An effort has been made to 
capture some of the interesting aspects of this discipline’s logical founda- 
tions. The resolution theorem is proved constructively. Since all recursive 
functions are computable in PROLOG, it is not hard to deduce the un- 
decidability of the existence problem for successful resolutions. 

Chapter 5 concerns applications of mathematical logic in mathematics 
itself. It presents various methods of model construction and contains the 
basic material for an introductory course on model theory. It contains in 
particular a model-theoretic proof of quantifier eliminability in the theory 
of real closed fields, which has a broad range of applications. 

A special aspect of the book is the thorough treatment of Gédel’s incom- 
pleteness theorems in Chapters 6 and 7. Chapters 4 and 5 are not needed 
here. 6.1! starts with basic recursion theory needed for the arithmeti- 
zation of syntax in 6.2 as well as in solving questions about decidability 
and undecidability in 6.5. Defining formulas for arithmetical predicates 
are classified early, to elucidate the close relationship between logic and 
recursion theory. Along these lines, in 6.5 we obtain in one sweep Gédel’s 
first incompleteness theorem, the undecidability of the tautology problem 
by Church, and Tarski’s result on the nondefinability of truth, all of which 
are based on certain diagonalization arguments. 6.6 includes among other 
things a sketch of the solution to Hilbert’s tenth problem. 

Chapter 7 is devoted mainly to Gédel’s second incompleteness theo- 
rem and some of its generalizations. Of particular interest thereby is the 
fact that questions about self-referential arithmetical statements are al- 
gorithmically decidable due to Solovay’s completeness theorem. Here and 
elsewhere, Peano arithmetic (PA) plays a key role, a basic theory for the 
foundations of mathematics and computer science, introduced already in 
3.3. The chapter includes some of the latest results in the area of self- 
reference not yet covered by other textbooks. 

Remarks in small print refer occasionally to notions that are undefined 
and direct the reader to the bibliography, or will be introduced later. 
The bibliography can represent an incomplete selection only. It lists most 


' This is to mean Section 6.1, more precisely, Section 1 in Chapter 6. All other boldface 
labels are to be read accordingly throughout the book. 
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English textbooks on mathematical logic and, in addition, some original 
papers mainly for historical reasons. It also contains some titles treating 
biographical, historical, and philosophical aspects of mathematical logic 
in more detail than this can be done in the limited size of our book. Some 
brief historical remarks are also made in the Introduction. Bibliographical 
entries are sorted alphabetically by author names. This order may slightly 
diverge from the alphabetic order of their citation labels. 

The material contained in this book will remain with high probability 
the subject of lectures on mathematical logic in the future. Its streamlined 
presentation has allowed us to cover many different topics. Nonetheless, 
the book provides only a selection of results and can at most accentuate 
certain topics. This concerns above all Chapters 4, 5, 6, and 7, which go 
a step beyond the elementary. Philosophical and foundational problems 
of mathematics are not systematically discussed within the constraints of 
this book, but are to some extent considered when appropriate. 

The seven chapters of the book consist of numbered sections. A refer- 
ence like Theorem 5.4 is to mean Theorem 4 in Section 5 of a given chap- 
ter. In cross-referencing from another chapter, the chapter number will 
be adjoined. For instance, Theorem 6.5.4 means Theorem 5.4 in Chap- 
ter 6. You may find additional information about the book or contact 
me on my website www.math.fu-berlin.de/~raut. Please contact me if 
you propose improved solutions to the exercises, which may afterward be 
included in the separate file Solution Hints to the Exercises. 

I would like to thank the colleagues who offered me helpful criticism 
along the way. Useful for Chapter 7 were hints from Lev Beklemishev 
and Wilfried Buchholz. Thanks also to Peter Agricola for his help in 
parts of the contents and in technical matters, and to Michael Knoop, 
Emidio Barsanti, and David Kramer for their thorough reading of the 
manuscript and finding a number of mistakes. 


Wolfgang Rautenberg, June 2009 
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Introduction 


Traditional logic as a part of philosophy is one of the oldest scientific 
disciplines. It can be traced back to the Stoics and to Aristotle! and 
is the root of what is nowadays called philosophical logic. Mathematical 
logic, however, is a relatively young discipline, having arisen from the en- 
deavors of Peano, Frege, and Russell to reduce mathematics entirely to 
logic. It steadily developed during the twentieth century into a broad dis- 
cipline with several subareas and numerous applications in mathematics, 
computer science, linguistics, and philosophy. 

One feature of modern logic is a clear distinction between object lan- 
guage and metalanguage. The first is formalized or at least formalizable. 
The latter is, like the language of this book, a kind of a colloquial language 
that differs from author to author and depends also on the audience the 
author has in mind. It is mixed up with semiformal elements, most of 
which have their origin in set theory. The amount of set theory involved 
depends on one’s objectives. Traditional semantics and model theory as 
essential parts of mathematical logic use stronger set-theoretic tools than 
does proof theory. In some model-theoretic investigations these are often 
the strongest possible ones. But on average, little more is assumed than 
knowledge of the most common set-theoretic terminology, presented in 
almost every mathematical course or textbook for beginners. Much of it 
is used only as a facon de parler. 

The language of this book is similar to that common to almost all math- 
ematical disciplines. There is one essential difference though. In math- 
ematics, metalanguage and object language strongly interact with each 
other, and the latter is semiformalized in the best of cases. This method 
has proved successful. Separating object language and metalanguage is 
relevant only in special context, for example in axiomatic set theory, where 
formalization is needed to specify what certain axioms look like. Strictly 
formal languages are met more often in computer science. In analyzing 
complex software or a programming language, as in logic, formal linguistic 
entities are the central objects of consideration. 


' The Aristotelian syllogisms are easy but useful examples for inferences in a first-order 
language with unary predicate symbols. One of these syllogisms serves as an example 
in Section 4.6 on logic programming. 
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The way of arguing about formal languages and theories is traditionally 
called the metatheory. An important task of a metatheoretic analysis is 
to specify procedures of logical inference by so-called logical calculi, which 
operate purely syntactically. There are many different logical calculi. The 
choice may depend on the formalized language, on the logical basis, and 
on certain aims of the formalization. Basic metatheoretic tools are in any 
case the naive natural numbers and inductive proof procedures. We will 
sometimes call them proofs by metainduction, in particular when talking 
about formalized object theories that speak about natural numbers. In- 
duction can likewise be carried out on certain sets of strings over a fixed 
alphabet, or on the system of rules of a logical calculus. 


The logical means of the metatheory are sometimes allowed or even ex- 
plicitly required to be different from those of the object language. But in 
this book the logic of object languages, as well as that of the metalang- 
uage, are classical, two-valued logic. There are good reasons to argue that 
classical logic is the logic of common sense. Mathematicians, computer 
scientists, linguists, philosophers, physicists, and others are using it as a 
common platform for communication. 


It should be noticed that logic used in the sciences differs essentially 
from logic used in everyday language, where logic is more an art than a se- 
rious task of saying what follows from what. In everyday life, nearly every 
utterance depends on the context. In most cases logical relations are only 
alluded to and rarely explicitly expressed. Some basic assumptions of two- 
valued logic mostly fail, in particular, a context-free use of the logical con- 
nectives. Problems of this type are not dealt with here. To some extent, 
many-valued logic or Kripke semantics can help to clarify the situation, 
and sometimes intrinsic mathematical methods must be used in order to 
solve such problems. We shall use Kripke semantics here for a different 
goal, though, the analysis of self-referential sentences in Chapter 7. 


Let us add some historical remarks, which, of course, a newcomer may 
find easier to understand after and not before reading at least parts of this 
book. In the relatively short period of development of modern mathemat- 
ical logic in the twentieth century, some highlights may be distinguished, 
of which we mention just a few. Many details on this development can be 
found in the excellent biographies [Daw] and [FF] on Gédel and Tarski, 
the leading logicians in the last century. 
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The first was the axiomatization of set theory in various ways. The most 
important approaches are those of Zermelo (improved by Fraenkel and von 
Neumann) and the theory of types by Whitehead and Russell. The latter 
was to become the sole remnant of Frege’s attempt to reduce mathematics 
to logic. Instead it turned out that mathematics can be based entirely on 
set theory as a first-order theory. Actually, this became more salient after 
the rest of the hidden assumptions by Russell and others were removed 
from axiomatic set theory around 1915; see [Hei]. For instance, the notion 
of an ordered pair, crucial for reducing the notion of a function to set 
theory, is indeed a set-theoretic and not a logical one. 


Right after these axiomatizations were completed, Skolem discovered 
that there are countable models of the set-theoretic axioms, a drawback 
to the hope for an axiomatic characterization of a set. Just then, two 
distinguished mathematicians, Hilbert and Brouwer, entered the scene 
and started their famous quarrel on the foundations of mathematics. It 
is described in a comprehensive manner for instance in [K12, Chapter IV] 
and need therefore not be repeated here. 


As anext highlight, Gédel proved the completeness of Hilbert’s rules for 
predicate logic, presented in the first modern textbook on mathematical 
logic, [HA]. Thus, to some extent, a dream of Leibniz became real, namely 
to create an ars inveniendi for mathematical truth. Meanwhile, Hilbert 
had developed his view on a foundation of mathematics into a program. It 
aimed at proving the consistency of arithmetic and perhaps the whole of 
mathematics including its nonfinitistic set-theoretic methods by finitary 
means. But Gédel showed by his incompleteness theorems in 1931 that 
Hilbert’s original program fails or at least needs thorough revision. 


Many logicians consider these theorems to be the top highlights of math- 
ematical logic in the twentieth century. A consequence of these theorems 
is the existence of consistent extensions of Peano arithmetic in which true 
and false sentences live in peaceful coexistence with each other, called 
“dream theories” in 7.3. It is an intellectual adventure of holistic beauty 
to see wisdom from number theory known for ages, such as the Chinese re- 
mainder theorem, simple properties of prime numbers, and Euclid’s char- 
acterization of coprimeness (page 249), unexpectedly assuming pivotal 
positions within the architecture of Gédel’s proofs. Gédel’s methods were 
also basic for the creation of recursion theory around 1936. 
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Church’s proof of the undecidability of the tautology problem marks an- 
other distinctive achievement. After having collected sufficient evidence 
by his own investigations and by those of Turing, Kleene, and some oth- 
ers, Church formulated his famous thesis (see 6.1), although in 1936 no 
computers in the modern sense existed nor was it foreseeable that com- 
putability would ever play the basic role it does today. 

Another highlight of mathematical logic has its roots in the work of 
Tarski, who proved first the undefinability of truth in formalized languages 
as explained in 6.5, and soon thereafter started his fundamental work on 
decision problems in algebra and geometry and on model theory, which 
ties logic and mathematics closely together. See Chapter 5. 

As already mentioned, Hilbert’s program had to be revised. A decisive 
step was undertaken by Gentzen, considered to be another groundbreaking 
achievement of mathematical logic and the starting point of contemporary 
proof theory. The logical calculi in 1.4 and 3.1 are akin to Gentzen’s 
calculi of natural deduction. 

We further mention Gédel’s discovery that it is not the axiom of choice 
(AC) that creates the consistency problem in set theory. Set theory with 
AC and the continuum hypothesis (CH) is consistent, provided set theory 
without AC and CH is. This is a basic result of mathematical logic that 
would not have been obtained without the use of strictly formal methods. 
The same applies to the independence proof of AC and CH from the axioms 
of set theory by Cohen in 1963. 

The above indicates that mathematical logic is closely connected with 
the aim of giving mathematics a solid foundation. Nonetheless, we confine 
ourself to logic and its fascinating interaction with mathematics, which 
characterizes mathematical logic. History shows that it is impossible to 
establish a programmatic view on the foundations of mathematics that 
pleases everybody in the mathematical community. Mathematical logic 
is the right tool for treating the technical problems of the foundations of 
mathematics, but it cannot solve its epistemological problems. 


Notation 


We assume that the reader is familiar with the most basic mathematical 
terminology and notation, in particular with the union, intersection, and 
complementation of sets, denoted by U, M, and \, respectively. Here we 
summarize only some notation that may differ slightly from author to 
author or is specific for this book. N, Z, Q, R denote the sets of natural 
numbers including 0, integers, rational, and real numbers, respectively, 
and N,,Qi,R4 the sets of positive members of the corresponding sets. 
n,m,t, j,k always denote natural numbers unless stated otherwise. Hence, 
extended notation like n € N is mostly omitted. 

In the following, M,N denote sets, M C N denotes inclusion, while 
M CN means proper inclusion (i.e., MC N and M4 N). As arule, we 
write M CN only if the circumstance M 4 N has to be emphasized. If 
M is fixed in a consideration and N varies over subsets of M/, then M\ N 
may also be symbolized by \N or AN. 

() denotes the empty set, and ®M the power set (= set of all subsets) 
of M. If one wants to emphasize that all elements of a set S are sets, S' is 
also called a system or family of sets. (J S denotes the union of S$, that is, 
the set of elements belonging to at least one M € S, and (]S stands for 
the intersection of a nonempty system 5S, the set of elements belonging to 
all Me S. If S={M;|i € I} then US and (]S are mostly denoted by 
User Mi and (),<; Mi, respectively. 

A relation between M and N is asubset of M x N, the set of ordered 
pairs (a,b) with a € M and bE N. A precise definition of (a,b) is given 
on page 114. Such a relation, f say, is said to be a function or mapping 
from M to N if for each a € M there is precisely one b € N with (a,b) € f. 
This b is denoted by f(a) or fa or a/ and called the value of f at a. We 
denote a function f from M to N also by f: M > N, or by f:r+ t(2), 
provided f(x) = t(x) for some term t (see 2.2). ranf = {fx|xe M} 
is called the range of f, and dom f = M its domain. idjy denotes the 
identical function on M, that is, idys(x) = x for all x € M. 

f:M — Nis injective if fa = fy > x = y, for all x,y € M, surjective 
if ran f = N, and biyective if f is both injective and surjective. The reader 
should basically be familiar with this terminology. The phrase “let f be 
a function from M to N” is sometimes shortened to “let f: M— N.” 
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XX Notation 


The set of all functions from a set I to a set M is denoted by M/. If 
f,g are functions with rang C dom f then h:x +> f(g(x)) is called their 
composition (or product). It will preferably be written as h = fog. 

Let J and M be sets, f:I — M, and call I the index set. Then f will 
often be denoted by (a;);ez and is named, depending on the context, an 
(indexed) family, an I-tuple, or a sequence. If 0 is identified with @ and 
n > 0 with {0,1,...,n —1}, as is common in set theory, then M” can 
be understood as the set of n-tuples (ai)icn = (@o,---,@n—1) of length n 
whose members belong to M. In particular, M° = {@}. Also the set of 
sequences (@1,...,@n) with a; € M will frequently be denoted by M”. In 
concatenating finite sequences, which has an obvious meaning, the empty 
sequence (i.e., 0), plays the role of a neutral element. (a1,...,d@,) will 
mostly be denoted by a. Note that this is the empty sequence for n = 0, 
similar to {a1,...,@n} for n = 0 always being the empty set. f@ means 
f(a1,...,@n) throughout. 

If A is an alphabet, i.e., if the elements s € A are symbols or at least 
named symbols, then the sequence (s),...,5,) € A” is written as $1 ---Sp, 
and called a string or a word over A. The empty sequence is called in 
this context the empty string. A string consisting of a single symbol s is 
termed an atomic string. It will likewise be denoted by s, since it will be 
clear from the context whether s means a symbol or an atomic string. 

Let €7 denote the concatenation of the strings € and 7. If € = €:n&2 for 
some strings €,,€2 and 7 #@ then 7 is called a segment (or substring) of 
€, termed a proper segment in case 7 # €. If €; = 9 then 7 is called an 
initial, if 2 = 0, a terminal segment of €. 

Subsets P,Q, R,...C M” are called n-ary predicates of M or n-ary re- 
lations. A unary predicate will be identified with the corresponding subset 
of M. We may write Pd for @ € P, and —Pa for @ ¢ P. Metatheoretical 
predicates (or properties) cast in words will often be distinguished from 
the surrounding text by single quotes, for instance, if we speak of the 
syntactic predicate ‘The variable x occurs in the formula a’. We can do 
so since quotes inside quotes will not occur in this book. Single-quoted 
properties are often used in induction principles or reflected in a theory, 
while ordinary (“double”) quotes have a stylistic function only. 

An n-ary operation of M is a function f: M”" > M. Since M® = {0}, a 
0-ary operation of M is of the form {(@,c)}, with c € M; it is denoted by 
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c for short and called a constant. Each operation f:M” — M is uniquely 


described by the graph of f, defined as 
sraph f= 4 Giese) | Gi) KH oe 


Both f and graph f are essentially the same, but in most situations it is 
more convenient to distinguish between them. 

The most important operations are binary ones. The corresponding 
symbols are mostly written between the arguments, as in the following 
listing of properties of a binary operation o on a set A. 0: A? — A is 


commutative if acb=boa for alla,be A, 

associative if ao(boc) = (acb)oc for all a,b,c € A, 

idempotent if aca=a for alla€ A, 

invertible if for all a,b € A there are r,y € A 
with acx = band yoa=b. 


If H,© (read eta, theta) are expressions of our metalanguage, H = O 
stands for ‘H iff ©’ which abbreviates ‘H if and only if ©’. Similarly, 
H=>OandH & O mean ‘if H then O’ and ‘H and O’, respectively, and 
HV O is to mean ‘H or O.’ This notation does not aim at formalizing the 
metalanguage but serves improved organization of metatheoretic state- 
ments. We agree that > ©, ... separate stronger than linguistic binding 
particles such as “there is” or “for all.” Therefore, in the statement 

‘XFbka@XFa, forall X and alla’ (Theorem 1.4.6) 


the comma should not be dropped; otherwise, some serious misunder- 
standing may arise: ‘X F a for all X and all @’ is simply false. 

H := O means that the expression H is defined by 0. When integrating 
formulas in the colloquial metalanguage, one may use certain abbreviating 
notation. For instance, ‘a = § and @ = 7’ is occasionally shortened 
toa = GP =y. (‘the formulas a, 3, and G,y are equivalent’). This is 
allowed, since in this book the symbol = will never belong to the formal 
language from which the formulas a, 3,7 are taken. W.l1.0.g. or w.l.o.g. is 
a colloquial shorthand of “without loss of generality” used in mathematics. 


‘This means that the left-hand term graph f is defined by the right-hand term. A 
corresponding meaning has := throughout, except in programs and flow diagrams, 
where x := t means the allocation of the value of the term t to the variable x. 


Chapter 1 


Propositional Logic 


Propositional logic, by which we here mean two-valued propositional logic, 
arises from analyzing connections of given sentences A, B, such as 


A and B, Aor B, not A, if A then B. 


These connection operations can be approximately described by two- 
valued logic. There are other connections that have temporal or local 
features, for instance, first A then B or here A there B, as well as unary 
modal operators like it is necessarily true that, whose analysis goes beyond 
the scope of two-valued logic. These operators are the subject of tempo- 
ral, modal, or other subdisciplines of many-valued or nonclassical logic. 
Furthermore, the connections that we began with may have a meaning in 
other versions of logic that two-valued logic only incompletely captures. 
This pertains in particular to their meaning in natural or everyday lan- 
guage, where meaning may strongly depend on context. 

In two-valued propositional logic such phenomena are set aside. This 
approach not only considerably simplifies matters, but has the advantage 
of presenting many concepts, for instance those of consequence, rule in- 
duction, or resolution, on a simpler and more perspicuous level. This will 
in turn save a lot of writing in Chapter 2 when we consider the corre- 
sponding concepts in the framework of predicate logic. 

We will not consider everything that would make sense in two-valued 
propositional logic, such as two-valued fragments and problems of defin- 
ability and interpolation. The reader is referred instead to [KK] or [Ral]. 
We will concentrate our attention more on propositional calculi. While 
there exists a multitude of applications of propositional logic, we will not 
consider technical applications such as the designing of Boolean circuits 
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and problems of optimization. These topics have meanwhile been inte- 
grated into computer science. Rather, some useful applications of the 
propositional compactness theorem are described comprehensively. 


1.1 Boolean Functions and Formulas 


Two-valued logic is based on two foundational principles: the principle of 
bivalence, which allows only two truth values, namely true and false, and 
the principle of extensionality, according to which the truth value of a 
connected sentence depends only on the truth values of its parts, not on 
their meaning. Clearly, these principles form only an idealization of the 
actual relationships. 


Questions regarding degrees of truth or the sense-content of sentences 
are ignored in two-valued logic. Despite this simplification, or indeed 
because of it, such a method is scientifically successful. One does not even 
have to know exactly what the truth values true and false actually are. 
Indeed, in what follows we will identify them with the two symbols 1 and 
0. Of course, one could have chosen any other apt symbols such as T and 
1 ort and f. The advantage here is that all conceivable interpretations of 
true and false remain open, including those of a purely technical nature, 
for instance the two states of a gate in a Boolean circuit. 


According to the meaning of the word and, the conjunction A and B of 
sentences A, B, in formalized languages written as Aa B or A& B, is true 
if and only if A, B are both true and is false otherwise. So conjunction 
corresponds to a binary function or operation over the set {0,1} of truth 


values, named the a-function and denoted by a. It is given by its value 
lel 100 
0o1 000 
truth table of a binary function © with arguments and values in {0, 1}. 


matrix fe a where, in general, ( ) represents the value matrix or 


The delimiters of these small matrices will usually be omitted. 


A function f:{0,1}" — {0,1} is called an n-ary Boolean function or 
truth function. Since there are 2” n-tuples of 0,1, it is easy to see that 
the number of n-ary Boolean functions is 22". We denote their totality by 
By. While Bz has 24 = 16 members, there are only four unary Boolean 
functions. One of these is negation, denoted by — and defined by =1 = 0 
and =0 = 1. Bo consists just of the constants 0 and 1. 
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The first column of the table below contains the common binary connec- 
tions with examples of their instantiation in English. The second column 
lists some of its traditional symbols, which also denote the corresponding 
truth function, and the third its truth table. Disjunction is the inclusive 
or and is to be distinguished from the exclusive disjunction. The latter 
corresponds to addition modulo 2 and is therefore given the symbol +. 
In Boolean circuits the functions +,|,7 are often denoted by zor, nor, 
and nand; the latter is also known as the Sheffer function. Recall our 
agreement in the section Notation that the symbols & V, =, and © 
will be used only on the metatheoretic level. 


A connected sentence and its corresponding truth function need not be 
denoted by the same symbol; for example, one might take ~ for conjunc- 
tion and et as the corresponding truth function. But in doing so one would 
only be creating extra notation, but no new insights. The meaning of a 
symbol will always be clear from the context: if a, @ are sentences of a for- 
mal language, then a @ denotes their conjunction; if a, b are truth values, 
then ab just denotes a truth value. Occasionally, we may want to refer 
to the symbols ~,v,7,... themselves, setting their meaning temporarily 
aside. Then we talk of the connectives or truth functors A,V,7,... 


compound sentence symbol | truth table 


conjunction 


A and B; A as well as B 
disjunction 
AorB 
implication 
if A then B; B provided A 
equivalence 
A if and only if B; A iff B 
exclusive disjunction 
either A or B but not both 
nihilation 
neither A nor B 
incompatibility 


A, & 


v,V 


Se Ref O;]O ry rR Clr O;orF|,oo 


EF OC; C0 OC], rF CO] OC Kt kK Kf kK eK | oO K 


not at once A and B 
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Sentences formed using connectives given in the table are said to be 
logically equivalent if their corresponding truth tables coincide. This is the 
case, e.g., for the sentences A provided B and A or not B, which represent 
the converse implication, denoted by A — B.' It does not appear in the 
table, since it arises by swapping A, B in the implication. This and similar 
reasons explain why only a few of the sixteen binary Boolean functions 
require notation. Some other examples of logical equivalent sentences are 
if A and B then C, C provided A and B, and A and B only if C. 

In order to recognize and describe logical equivalence of compound sen- 
tences it is useful to create a suitable formalism or a formal language. The 
idea is basically the same as in arithmetic, where general statements are 
more clearly expressed by means of certain formulas. As with arithmetical 
terms, we consider propositional formulas as strings of signs built in given 
ways from basic symbols. Among these basic symbols are variables, for 
our purposes called propositional variables, the set of which is denoted by 
PV. Traditionally, these are symbolized by po,p1,... However, our num- 
bering of the variables below begins with p, rather than with po, enabling 
us later on to represent Boolean functions more conveniently. Further, we 
use certain logical signs such as A,v,—,..., similar to the signs +,-,... of 
arithmetic. Finally, parentheses ( , ) will serve as technical aids, although 
these are dispensable, as will be seen later on. 

Each time a propositional language is in question, the set of its logi- 
cal symbols, called the logical signature, and the set of its variables must 
be given in advance. For instance, it is crucial in some applications of 
propositional logic in Section 1.5 for PV to be an arbitrary set, and not 
a countably infinite one as indicated previously. Put concretely, we de- 
fine a propositional language F of formulas built up from the symbols 
(,),A,V575P1,p2,--. inductively as follows: 


(F1) The atomic strings pi,po,... are formulas, called prime formulas, 
also called atomic formulas, or simply prime. 

(F2) If the strings a, 3 are formulas, then so too are the strings (a {), 
(a v PB), and na. 


This is a recursive (somewhat sloppily also called inductive) definition 
in the set of strings on the alphabet of the mentioned symbols, that is, 


' Converse implication is used in the programming language PROLOG, see 4.6. 
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only those strings gained using (F1) or (F2) are in this context formulas. 
Stated set-theoretically, F is the smallest (i.e., the intersection) of all sets 
of strings S built from the aforementioned symbols with the properties 


(f1) pi,po,...€ S, (f2)a,B8ES => (ar§B),(av B),7ae€S. 


Example. (p; A (p2 v —p1)) is a formula. On the other hand, its initial 
segment (p; A(p2 Vv 7p) is not, because a closing parenthesis is missing. 
It is intuitively clear and will rigorously be proved on the next page that 
the number of left parentheses occurring in a formula coincides with the 
number of its right parentheses. 


Remark 1. (f1) and (f2) are set-theoretic translations of (F1) and (F2). Some 
authors like to add a third condition to (F1), (F2), namely (F3): No other strings 
than those obtained by (F1) and (F2) are formulas in this contert. But this at 
most underlines that (F1), (F2) are the only formula-building rules; (F3) follows 
from our definition, as its set-theoretic translation by (f1), (f2) indicates. Note 
that we do not strictly distinguish between the symbol p; and the prime formula 
or atomic string p;. Note also that in the formula definition parentheses are 
needed only for binary connectives, not if a formula starts with —. By a slightly 
more involved definition at least the outermost parentheses in formulas of the 
form (ao G) with a binary connective o could be saved. Howsoever propositional 
formulas are defined, what counts is their unique readability, see page 7. 


The formulas defined by (F1), (F2) are called Boolean formulas, because 
they are obtained using the Boolean signature {\,v,—}. Should further 
connectives belong to the logical signature, for example — or ©, (F2) of 
the above definition must be augmented accordingly. But unless stated 
otherwise, (a +) and (a + #) are here just abbreviations; the first is 
=(a@A-8), the second is ((a +8)A(G—-a)). 

Occasionally, it is useful to have symbols in the logical signature for 
always false and always true, 1 and T respectively, say, called falsum 
and verum and sometimes also denoted by 0 and 1. These are to be 
regarded as supplementary prime formulas, and clause (F1) should be 
altered accordingly. However, we prefer to treat 1 and T as abbreviations: 
i= (pi AA) Sd Tk, 

For the time being we let F be the set of all Boolean formulas, although 
everything said about F holds correspondingly for any propositional lan- 
guage. Propositional variables will henceforth be denoted by p,q,..., 
formulas by a, 3, 7,6,y%,-.-, prime formulas also by 7, and sets of propo- 
sitional formulas by X,Y, Z, where these letters may also be indexed. 
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For the reason of parenthesis economy in formulas, we set some conven- 
tions similar to those used in writing arithmetical terms. 


1. The outermost parentheses in a formula may be omitted (if there 
are any). For example, (p v qg)A7p may be written in place of 
((p v qg)Anp). Note that (p v q)A-p is not itself a formula but 
denotes the formula ((p v q) Ap). 


2. In the order =, A,v, +,<, each connective binds more strongly 
than those following it. Thus, one may write p v qA-p instead of 
pv (qAn7p), which means (p v (¢A-p)) by convention 1. 


3. By the multiple use of — we associate to the right. So p—+q—p 
is to mean p—(q—p). Multiple occurrences of other binary con- 
nectives are associated to the left, for instance, p~aqA7p means 
(pAq)Anp. In place of agA +++ AQn and agv:::Va, we may write 
Ni<n a; and \/; <n Oi, respectively. 


Also, in arithmetic, one normally associates to the left. An exception is 
the term x”, where traditionally association to the right is used, that is, 
x” equals 2). Association to the right has some advantages in writing 
tautologies in which — occurs several times; for instance in the examples 
of tautologies listed in 1.3 on page 18. 

The above conventions are based on a reliable syntax in the framework 
of which intuitively clear facts, such as the identical number of left and 
right parentheses in a formula, are rigorously provable. These proofs are 
generally carried out using induction on the construction of a formula. To 
make this clear we denote by €y that a property € holds for a string y. 
For example, let € mean the property ‘y is a formula that has equally 
many right- and left-hand parentheses’. € is trivially valid for prime 
formulas, and if €a, €3 then clearly also E(an), E(a v GB), and E-a. 
From this we may conclude that € applies to all formulas, our reasoning 
being a particularly simple instance of the following 
Principle of formula induction. Let € be a property of strings that 
satisfies the conditions 


(0) Ex for all prime formulas x, 
(s) Ea,EB > E(anB),E(av PB), Ena, for alla, BE F. 
Then Ev holds for all formulas yp. 
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The justification of this principle is straightforward. The set S of all 
strings with property € has, thanks to (o) and (s), the properties (f1) 
and (f2) on page 5. But F is the smallest such set. Therefore, F C S. 
In words, € applies to all formulas y. Clearly, if other connectives are 
involved, condition (s) must accordingly be modified. 

It is intuitively clear and easily confirmed inductively on y that a com- 
pound Boolean formula ¢ (i.e., y is not prime) is of the form y = 7a 
or yp = (aa) or y = (a v 8B) for suitable a, 3 € F. Moreover, this de- 
composition is unique. For instance, (aA) cannot at the same time be 
written (a’ v 2’) with perhaps different formulas a’, 3’. Thus, compound 
formulas have the unique readability property, more precisely, the 
Unique formula reconstruction property. Each compound formula 
yp € F is either of the form 7a or (ao B) for some uniquely determined 
formulas a, 3 € F, where o is either a orv. 

This property is less obvious than it might seem. Nonetheless, the proof 
is left as an exercise (Exercise 4) in order to maintain the flow of things. It 
may be a surprise to the novice that for the unique formula reconstruction, 
parentheses are dispensable throughout. Indeed, propositional formulas, 
like arithmetical terms, can be written without any parentheses; this is 
realized in Polish notation (= PN), also called prefix notation, once widely 
used in the logic literature. The idea consists in altering (F2) as follows: 
if a, 8 are formulas then so too are naB, va, and 7a. Similar to PN is 
RPN (reverse Polish notation), still used in some programming languages 
like PostScript. RPN differs from PN only in that a connective is placed 
after the arguments. For instance, (pA(q v 7p)) is written in RPN as 
pqp-v \. Reading PN or RPN requires more effort due to the high density 
of information; but by the same token it can be processed very fast by a 
computer or a high-tech printer getting its job as a PostScript program. 
The only advantage of the parenthesized version is that its decoding is 
somewhat easier for our eye through the dilution of information. 

Intuitively it is clear what a subformula of a formula ¢ is; for example, 
(qAnp) is a subformula of (p v (qA-p)). All the same, for some pur- 
poses it is convenient to characterize the set Sf of all subformulas of y 
inductively: 

Sf = {a} for prime formulas 7; Sfra=SfaU {-a}, 
Sf(ao 8) =SfaUSf GU {(ao B)} for a binary connective o. 
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Thus, a formula is always regarded as a subformula of itself. The above is 
a typical example of a recursive definition on the construction of formulas. 
Another example of such a definition is the rank rk y of a formula y, which 
provides a sometimes more convenient measure of the complexity of y 
than its length as a string and occasionally simplifies inductive arguments. 
Intuitively, rk y is the highest number of nested connectives in y. Let 
rk a = O for prime formulas 7, and if rka@ and rk are already defined, 
then rk sa = rka+1 and rk(aoZ) = max{rka,rk 3}+1. Here o denotes 
any binary connective. We will not give here a general formulation of 
this definition procedure because it is very intuitive and similar to the 
well-known procedure of recursive definitions on N. It has been made 
sufficiently clear by the preceding examples. Its justification is based 
on the unique reconstruction property and insofar not quite trivial, in 
contrast to the proof procedure by induction on formulas that immediately 
follows from the definition of propositional formulas. 

If a property is to be proved by induction on the construction of formulas 
vy, we will say that it is a proof by induction on y. Similarly, the recursive 
construction of a function f on F will generally be referred to as defining 
f by recursion on y, often somewhat sloppily paraphrased as defining f 
by induction on vy. Examples are Sf and rk. Others will follow. 

Since the truth value of a connected sentence depends only on the truth 
values of its constituent parts, we may assign to every propositional vari- 
able of a a truth value rather than a sentence, thereby evaluating a, 
ie., calculating a truth value. Similarly, terms are evaluated in, say, 
the arithmetic of real numbers, whose value is then a real (= real num- 
ber). An arithmetical term ¢ in the variables 21,...,2,, describes an 
n-ary function whose arguments and values are reals, while a formula y 
in p1,.--,Pn describes an n-ary Boolean function. To be precise, a propo- 
sitional valuation, or alternatively, a (propositional) model, is a mapping 
w: PV — {0,1} that can also be understood as a mapping from the set 
of prime formulas to {0,1}. We can extend this to a mapping from the 
whole of F to {0,1} (likewise denoted by w) according to the stipulations 

(*) wlarB)=wanwB; wav B)=wavw; wraa=-wa.? 
By the value wy of a formula y under a valuation w: PV — {0,1} 


? We often use (+) or (*) as a temporary label for a condition (or property) that we 
refer back to in the text following the labeled condition. 
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we mean the value given by this extension. We could denote the ex- 
tended mapping by w, say, but it is in fact not necessary to distinguish 
it symbolically from w: PV — {0,1} because the latter determines the 
extension uniquely. Similarly, we keep the same symbol if an operation 
in N extends to a larger domain. If the logical signature contains further 
connectives, for example —, then (*) must be supplemented accordingly, 
with w(a +2) = wa —w/Z in the example. However, if — is defined as 
in the Boolean case, then this equation must be provable. Indeed, it is 
provable, because from our definition of a — @ it follows that 


w(a > B) = w7(a a AB) = aw(a a aB) = 7(wa a aw) = wa > wB, 


for any w. A corresponding remark could be made with respect to 
and to T and 1. Always wT = 1 and wt = 0 by our definition of T, 1, 
in accordance with the meaning of these symbols. However, if these or 
similar symbols belong to the logical signature, then suitable equations 
must be added to the definition of w. 


Let F,, denote the set of all formulas of F in which at most the variables 
P1,+-+;Pn occur (n > 0). Then it can easily be seen that wa for the 
formula a € F, depends only on the truth values of p1,...,Pn. In other 
words, a € F,, satisfies for all valuations w, w’, 


(x) wa =w'a whenever wp; = w'p; for i=1,...,n. 


The simple proof of (x) follows from induction on the construction of for- 
mulas in F,, observing that these are closed under the operations 7, A, v. 
Clearly, (x) holds for p € F,, and if (x) is valid for a, 3 € Fn, then also 
for =a, aAZ, and a v @. It is then clear that each a € F,, defines or 
represents an n-ary Boolean function according to the following 


Definition. a € F, represents the function f € B,, (or f is represented 
by a) whenever wa = fwp (:= f(wpi,...,wWpn)) for all valuations w. 


Because wa for a € Fp, is uniquely determined by wp,...,wpn, @ 
represents precisely one function f € By, sometimes written as a”, For 
instance, both py; Ap2 and 7(-p; v pz) represent the a-function, as can 
easily be illustrated using a table. Similarly, =p; v po and 7(p;A-p2) 
represent the —-function, and pi: v p2, >(>p1A7p2), (p1 > p2) > pe all 
represent the v-function. Incidentally, the last formula shows that the 
v-connective can be expressed using implication alone. 
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There is a caveat though: since @ = p, v po, for instance, belongs 
not only to Fz but to F3 as well, a also represents the Boolean function 
f:(@1, 22,23) ++ Z1Vvx2. However, the third argument is only “fictional,” 
or put another way, the function f is not essentially ternary. 

In general we say that an operation f: M" — M is essentially n-ary if f 
has no fictional arguments, where the ith argument of f is called fictional 
whenever for all 21,...,2j,...%n € M and all x, € M, 


Tl Piss 5 Pasian) SF Gigos sgl tain) 


Identity and the —-function are the essentially unary Boolean functions, 
and out of the sixteen binary functions, only ten are essentially binary, as 
is seen in scrutinizing the possible truth tables. 

Remark 2. If a, denotes temporarily the number of all n-ary Boolean func- 
tions and e, the number of all essentially n-ary Boolean functions, it is not 
particularly difficult to prove that a, = ye i) e;. Solving for e, results in 


en = en ee aes However, we will not make use of these equations. 
These become important only in a more specialized study of Boolean functions. 


Exercises 


1. f € By is called linear if f(a1,...,%n) = ao +0141 + +++ + Gn2p for 
suitable coefficients ao,...,@n € {0,1}. Here + denotes exclusive 
disjunction (addition modulo 2) and the not written multiplication 
is conjunction (i.e., a;2; = x; for a; = 1 and a;x; = 0 for a; = 0). 
(a) Show that the above representation of a linear function f is 
unique. (b) Determine the number of n-ary linear Boolean functions. 
(c) Prove that each formula a in =,+ (i.e., a is a formula of the 
logical signature {—,+}) represents a linear Boolean function. 


2. Verify that a compound Boolean formula vy is either of the form 
yp = 7a or else y = (aA) or y = (av B) for suitable formulas a, 3 
(this is the easy part of the unique reconstruction property). 


3. Prove that a proper initial segment of a formula y is never a formula. 
Equivalently: If a€ = 6n with a,@ © F and arbitrary strings €,7, 
then a = @. The same holds for formulas in PN, but not in RPN. 


4. Prove (with Exercise 3) the second more difficult part of the unique 
reconstruction property, the claim of uniqueness. 
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1.2 Semantic Equivalence and Normal Forms 


Throughout this chapter w will always denote a propositional valuation. 
Formulas a, are called (logically or semantically) equivalent, and we 
write a = 2, when wa = wf for all valuations w. For example a = —-7a. 
Obviously, a = ( iff for any n such that a, @ © F,, both formulas represent 
the same n-ary Boolean function. It follows that at most 2?” formulas in 
F, can be pairwise inequivalent, since there are no more than 2?” n-ary 
Boolean functions. 


In arithmetic one writes simply s = t to express that the terms s,t rep- 
resent the same function. For example, (a+ y)? = x? +2xry+y? expresses 
the equality of values of the left- and right-hand terms for all z,y € R. 
This way of writing is permissible because formal syntax plays a minor role 
in arithmetic. In formal logic, however, as is always the case when syntac- 
tic considerations are to the fore, one uses the equality sign in messages 
like a = £ only for the syntactic identity of the strings a and @. There- 
fore, the equivalence of formulas must be denoted differently. Clearly, for 
all formulas a, 3,7 the following equivalences hold: 


ar(Bay) =arBay, av(Bvy) =avBvy_ (associativity); 
anB = Baa, avB= Bva (commutativity); 
ANd = a, ava=a (idempotency); 
ar(av B) =a, ava\sg=a (absorption); 
an(Bvy) = anBvany, (A-distributivity); 
av Bay = (avB)a(avy) (v-distributivity); 
A(arB) = 7mav 7B, =(av B) = 7aavn7G8 (de Morgan rules). 


Furthermore, a v n@ =T, @A7Aa@= 1, andaaT=avi=a. It is also 
useful to list certain equivalences for formulas containing —, for example 
the frequently used a > 3 = 7a v B (= 7(a@A-7fG), and the important 

a8 +7 = 08 += B04. 

To generalize: ay > ++: ~Qpn = Q1 A+++ AQn-1 7 Qn. Further, we men- 
tion the “left distributivity” of implication with respect to ~ and v, namely 
a-Bray=(a>B)r(a>y7); arhvy=(a-B) v (a7). 

Should the symbol — lie to the right then the following are valid: 


arnprays(aryv(Boy); avBoys(asya(s—y). 
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Remark 1. These last two logical equivalences are responsible for a curious 
phenomenon in everyday language. For example, the two sentences 


A: Students and pensioners pay half price, 
B: Students or pensioners pay half price 


evidently have the same meaning. How to explain this? Let student and pen- 
sioner be abbreviated by S, P, and pay half price by H. Then 


a: (S>H),(P—-4), GB: (SvP)-H 

express somewhat more precisely the factual content of A and B, respectively. 
Now, according to our truth tables, the formulas a and £ are simply logically 
equivalent. The everyday-language statements A and B of a and ( obscure the 
structural difference of a and ( through an apparently synonymous use of the 
words and and or. 

Obviously, = is an equivalence relation, that is, 

a=a (reflexivity), 
a=B > B=a (symmetry), 
a=8,8=7 > a=y (transitivity). 
Moreover, = is a congruence relation on F,° i.e., for all a, a’, 3, 8", 
a=a,B=C3' > acB=a'of'",7-a=-a! (o€ {n, v }). 

For this reason the replacement theorem holds: a = a! => 9g = Y, 
where y’ is obtained from y by replacing one or several of the possi- 
ble occurrences of the subformula @ in y by a’. For instance, by re- 
placing the subformula sp v 7q by the equivalent formula —=(paq) in 
yp = (ap v 7q) A(p v q) we obtain y! = 7(prq)a(p v q), which is equiva- 
lent to y. A similar replacement theorem also holds for arithmetical terms 
and is constantly used in their manipulation. This mostly goes unnoticed, 
because = is written instead of =, and the replacement for = is usually 
correctly applied. The simple inductive proof of the replacement theorem 
will be given in a somewhat broader context in 2.4. 

Furnished with the equivalences ~7a@ = a, 7=(aA) = 7a v 7G, and 
a(a v 3) = 7a, and using replacement it is easy to construct for each 
formula an equivalent formula in which — stands only in front of variables. 


For example, =(prq v r) = 7(paq) Arr = (=p v 7g) A77r is obtained in 
this way. This observation follows also from Theorem 2.1. 


3This concept, stemming originally from geometry, is meaningfully defined in every 
algebraic structure and is one of the most important and most general mathematical 
concepts; see 2.1. The definition is equivalent to the condition 


a=a => aop=a' of, Boa=Boad',7a=-<’, for all a,a’, B. 
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It is always something of a surprise to the newcomer that independent of 
its arity, every Boolean function can be represented by a Boolean formula. 
While this can be proved in various ways, we take the opportunity to 
introduce certain normal forms and therefore begin with the following 


Definition. Prime formulas and negations of prime formulas are called 
literals. A disjunction a,v---V@,, where each a; is a conjunction of liter- 
als, is called a disjunctive normal form, a DNF for short. A conjunction 
By A+++ A Bn, where every (; is a disjunction of literals, is called a con- 
junctive normal form, a CNF for short. 


Example 1. The formula p v (¢A 7p) is a DNF; pv q is at once a DNF 
and a CNF; pv 7(qa7p) is neither a DNF nor a CNF. 


Theorem 2.1 states that every Boolean function is represented by a 
Boolean formula, indeed by a DNF, and also by a CNF. It would suffice 
to show that for given n there are at least 2?" pairwise inequivalent DNFs 
(resp. CNFs). However, we present instead a constructive proof whereby 
for a Boolean function given in tabular form a representing DNF (resp. 
CNF) can explicitly be written down. In Theorem 2.1 we temporarily 
use the following notation: p! := p and p® := 7p. With this stipulation, 
w(pr i Aps?) = 1 iff wp, = 2 and wp2 = x2. More generally, induction 
on n > 1 easily shows that for all 71,...,2 € {0,1}, 


In 


(4) wip Ao Ape) = 1S wp 2 iby why = ij a = Sw) 


Theorem 2.1. Every Boolean function f with f € By, (n > 0) is repre- 
sentable by a DNF, namely by 


api= Vo pyta-: Apin4 
f#=1 
At the same time, f is representable by the CNF 
eee ae ee 
fz=0 
Proof. By the definition of ay, the following equivalences hold for an 
arbitrary valuation w: 


‘The disjuncts of af can be arranged, for instance, according to the lexicographical 
order of the n-tuples (11,...,2n) € {0,1}”. If the disjunction is empty (that is, if f 
does not take the value 1) let ay be 1 (= pi1a-7p1). Thus, the empty disjunction is 
1. Similarly, the empty conjunction equals T (= 41). These conventions correspond 
to those in arithmetic, where the empty sum is 0 and the empty product is 1. 
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waf=1 © there is an ¢ with f¢=1 and w(py'a--: Apzr) =1 
= there is an ¢ with ff =1 and wp = (by (*)) 
= fwp=1 (replace Z% by wp). 
Thus, war = 1 & fwp = 1. From this equivalence, and because there 
are only two truth values, war = fwp follows immediately. The repre- 
sentability proof of f by @f runs analogously; alternatively, Theorem 2.4 
below may be used. [J 


Example 2. For the ezclusive-or function +, the construction of af 
in Theorem 2.1 gives the representing DNF p,; Apo v —p; \p2, because 
(1,0), (0,1) are the only pairs for which + has the value 1. The CNF given 
by the theorem, on the other hand, is (p; v pz) A(—p1 v —p2); the equiv- 
alent formula (p, v p2) A7(p1 A p2) makes the meaning of the exclusive-or 
compound particularly intuitive. 


Pi Ap2 V Api A p2 V Api A7p2 is the DNF given by Theorem 2.1 for the 
Boolean function —. It is longer than the formula 4p; v pg, which is 
also a representing DNF. But the former is distinctive in that each of its 
disjuncts contains each variable occurring in the formula exactly once. 
A DNF of n variables with the analogous property is called canonical. 
The notion of canonical CNF is correspondingly explained. For instance, 
the function < is represented by the canonical CNF (-p,vp2) A (piv—p2) 
according to Theorem 2.1, which always provides canonical normal forms 
as representing formulas. 

Since each formula represents a certain Boolean function, Theorem 2.1 
immediately implies the following fact, which has also a (more lengthy) 
syntactical proof with the replacement theorem mentioned on page 12. 


Corollary 2.2. Each py € F is equivalent to a DNF and to a CNF. 


Functional completeness. A logical signature is called functional 
complete if every Boolean function is representable by a formula in this 
signature. Theorem 2.1 shows that {=,~, v } is functional complete. 
Because of p v gq = 7(—pa-7q) and paq = -(—p v 7q), one can further 
leave aside v, or alternatively ~. This observation is the content of 


Corollary 2.3. Both {3,} and {7, v} are functional complete. 


Therefore, to show that a logical signature L is functional complete, it 
is enough to represent =, A or else 4, v by formulas in L. For example, 
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because =p = p +0 and pag = 7(p 779), the signature { > ,0} is func- 
tional complete. On the other hand, { >, ~, v }, and a fortiori { — }, are 
not. Indeed, wy = 1 for any formula y in —,a, v and any valuation w 
such that wp = 1 for all p. This can readily be confirmed by induction on 
y. Thus, never =p = y for any such formula y. 

It is noteworthy that the signature containing only | is functional com- 
plete: from the truth table for | we get ap = p| pas well as pag = 7p 74. 
Likewise for {1}, because =p = ptp and pv q=—-ptn7gq. That {1} must 
necessarily be functional complete once we know that {| } is will become 
obvious in the discussion of the duality theorem below. Even up to term 
equivalence, there still exist infinitely many signatures. Here signatures 
are called term equivalent if the formulas of these signatures represent the 
same Boolean functions as in Exercise 2, for instance. 

Define inductively on the formulas from F¥ a mapping 6: F — F by 


p=p, (na)’=-a%, (anf) =aPv A, (av BP =aPrf?. 


a? is called the dual formula of a and is obtained from a simply by inter- 


6 


changing A and v. Obviously, for a DNF a, a° is a CNF, and vice versa. 
Define the dual of f € Bn by f°# := Afaz with Az := (721,..., 72). 
Clearly fo := (f°)? = f since (f°)°# = -Afz = f#. Note that 
Ai =v, vio = a, O8= +4, 19 = 7, but —° ==. In other words, - is self- 
dual. One may check by going through all truth tables that essentially 
binary self-dual Boolean functions do not exist. But it was Dedekind who 


discovered the interesting ternary self-dual function 
dg: (@1,%2,2%3) > ©1AX2 V 11 ALB V LQAZ3. 
The above notions of duality are combined in the following 
Theorem 2.4 (The duality principle for two-valued logic). [f a 


represents the function f then a® represents the dual function f°. 


Proof by induction on a. Trivial for a = p. Let a,@ represent fi, fo, 
respectively. Then af represents f : Zt f 2A fox, and in view of the 
induction hypothesis, (a~3)*> = a® v 6° represents g : fiz Vv fz. 
This function is just the dual of f because 


PPB = Afr = Afi frr#) = AfivE v for = fev [$F = gf. 


The induction step for v is similar. Now let @ represent f. Then 7a 


represents af :Z ++ af#. By the induction hypothesis, a® represents f°. 
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Thus (=a)° = 7a represents = f°, which coincides with (=f)* because of 
(af) = (nt = (fra) = (f°). 


For example, we know that « is represented by pag v ~pa-7q. Hence, 


by Theorem 2.4, + (= °) is represented by (pvq) \(—pv7q). More gener- 
ally, if a canonical DNF a represents f € By, then the canonical CNF a? 
represents f°. Thus, if every f € By, is representable by a DNF then ev- 
ery f must necessarily be representable by a CNF, since f + f° maps Bn 
bijectively onto itself as follows from ia = f. Note also that Dedekind’s 
just defined ternary self-dual function d3 shows in view of Theorem 2.4 
that paqv parv qar = (pv Qa(pvr)a(qvr). 

Remark 2. {~,v,0,1} is maximally functional incomplete, that is, if f is any 
Boolean function not representable by a formula in ~,v,0,1, then {~,v,0,1, f} 
is functional complete (Exercise 4). As was shown by E. Post (1920), there are up 
to term equivalence only five maximally functional incomplete logical signatures: 
besides {a ,v,0,1} only {-, a}, the dual of this, {¢,-}, and {d3,-}. The 
formulas of the last one represent just the self-dual Boolean functions. Since 
ap = 1+ :, the signature {0,1,+,-} is functional complete, where - is written 
in place of ~. The deeper reason is that {0,1,+,-} is at the same time the 
extralogical signature of fields (see 2.1). Functional completeness in the two- 
valued case just derives from the fact that for a finite field, each operation on 
its domain is represented by a suitable polynomial. We mention also that for 
any finite set M of truth values considered in many-valued logics there is a 
generalized two-argument Sheffer function, by which every operation on M can 
be obtained, similarly to + in the two-valued case. 


Exercises 
1. Verify the logical equivalences 
(p > q1) (7p > 42) = PAG VY “PAQ, 
PiAG > P2 VG = (pi > 2) v (G1 > 4). 


2. Show that the signatures {+,1}, {+, 4}, {<,0}, and {<, +} are all 
term equivalent. The formulas of each of these signatures represent 
precisely the linear Boolean functions. 


3. Show that the formulas in ~,v,0,1 represent exactly the monotonic 
Boolean functions. These are the constants from Bo, and for n > 0 
the f € By such that for all 7 with 1 <i<n, 


iis par ,€j—-1,0, €i41, Selecta Ba < F(z4, see, Di-1, 1, Vi41,. ete , in): 
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4. Show that the logical signature {~,v,0,1} is maximally functional 
incomplete. 


5. If one wants to prove Corollary 2.2 syntactically with the properties 
of = (page 11) one needs generalizations of the distributivity, e.g., 


Vien % A Viem Fi = Vien, j<m(oi% 9;). Verify the latter. 


1.3. Tautologies and Logical Consequence 


Instead of wa = 1 we prefer from now on to write w F a and read this w 
satisfies a. Further, if X is a set of formulas, we write wF X ifwEa 
for all a € X and say that w is a (propositional) model for X. A given a 
(resp. X) is called satisfiable if there is some w with w F a (resp. w F X). 
F, called the satisfiability relation, evidently has the following properties: 


wEp & wp=1 (pe PV); wEna & wFa; 
wFaaB & wFEaandwFf; weave & wEaorwF B. 
One can define the satisfiability relation w F a for a given w: PV — {0,1} 


also inductively on a, according to the clauses just given. This approach 
is particularly useful for extending the satisfiability conditions in 2.3. 


It is obvious that w: PV — {0,1} will be uniquely determined by setting 


down in advance for which variables w F p should be valid. Likewise the 
notation w F a for a € Fy, is already meaningful when w is defined only 
for p1,---,Pn- One could extend such a w to a global valuation by setting, 
for instance, wp = 0 for all unmentioned variables p. 


For formulas containing other connectives the satisfaction conditions 
are to be formulated accordingly. For example, we expect 
(4) wFEa-6 © ifwEathenwEF @. 
If — is taken to be a primitive connective, (*) is required. However, we 
defined — in such a way that (*) is provable. 


Definition. a is called logically valid or a (two-valued) tautology, in short 
F a, whenever w F aq for all valuations w. A formula not satisfiable at all, 
ie. wF a for all w, is called a contradiction. 


Examples. p v 7p is a tautology and so is a v 7a for every formula a, 
the so-called law of the excluded middle or the tertium non datur. On the 
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other hand, aj7a and a + 7a are always contradictions. The following 
tautologies in — are mentioned in many textbooks on logic. Remember 
our agreement about association to the right in formulas in which — 
repeatedly occurs. 

pp 

(p> 4) >(Q>7) > (p>) 

(p qr) +(q>p-r) 

PpP7qd7>P 

(p>q—>7r) >(p>q) >(p>r) 

((p +9) +p) +P 


(self-implication), 
(chain rule), 

(exchange of premises), 
(premise charge), 
(Frege’s formula), 
(Peirce’s formula). 


It will later turn out that all tautologies in — alone are derivable (in a 
sense still to be explained) from the last three formulas. 


Clearly, it is decidable whether a formula a is a tautology, in that one 
tries out the valuations of the variables of a. Unfortunately, no essentially 
more efficient method is known; such a method exists only for formulas 
of a certain form. We will have a somewhat closer look at this problem 
in 4.3. Various questions such as checking the equivalence of formulas 
can be reduced to a decision about whether a formula is a tautology. For 
notice the obvious equivalence of a= 6 andF a+ £. 


Basic in propositional logic is the following 


Definition. a is a logical consequence of X, written X F a, if w F a for 


every model w of X. In short, w FE X => wa, for all valuations w. 


While we use F both as the symbol for logical consequence (which is a 
relation between sets of formulas X and formulas qa) and the satisfiability 
property, it will always be clear from the context what F actually means. 
Evidently, a@ is a tautology iff @ F a, so that F a can be regarded as an 


abbreviation for J F a. 

In this book, X F a,@ will always mean ‘X F a and X F #’. More 
generally, X F Y is always to mean ‘X F @ for all G € Y’. We also write 
throughout aj,...,Q@, — @ in place of {a1,...,a,} F 8, and more briefly, 
X,aF @ in place of X U{a} F B. 

Examples of logical consequence. (a) a,@ - avZ and aaZF a, /. 
This is evident from the truth table of ~. (b) a,a—G F £, because 
1-2 =1-> 2=1 according to the truth table of —. 
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(c) X FL = X Fa for each a. Indeed, X F 1 = py Ap, obviously 
means that X is unsatisfiable (has no model), as e.g. X = {pe2, 7p2}. 

(d) X,aFGB& X,7aF- GB => XE BZ. In order to see this let w F X. 
If w F a then X,aF @ and hence w F #, and if w F a (ie, w F 7a) 
then w F 6 clearly follows from X,7aF (3. Note that (d) reflects our case 


distinction made in the naive metatheory while proving (d). 


Example (a) could also be stated as X F a,8 = X F anf. The 
property exemplified by (b) is called the modus ponens when formulated 
as a rule of inference, as will be done in 1.6. Example (d) is another 
formulation of the often-used procedure of proof by cases: In order to 
conclude a sentence @ from a set of premises X it suffices to show it to be 
a logical consequence both under an additional supposition and under its 
negation. This is generalized in Exercise 3. 


Important are the following general and obvious properties of 
(R) aEexX>XFa (reflexivity), 
(M) XFa&XCX' => X'EFa_ (monotonicity), 
(T) XFY&YFa => XFa (transitivity). 


Useful for many purposes is also the closure of the logical consequence 


relation under substitution, which generalizes the fact that from p v ap 

all tautologies of the form a@ v 7a arise from substituting a for p. 

Definition. A (propositional) substitution is a mapping 0 : PV ~F 

that is extended in a natural way to a mapping o : F — GF as follows: 
(arn B)? =a? rnB?, (av B)? =a’ v B?, (na)? =-a’. 

Thus, like valuations, substitutions are considered as operations on the 
whole of ¥. For example, if p’ = a for some fixed p and q? = q otherwise, 
then y® arises from y by substituting a for p at all occurrences of p in 
vy. From p v 7p arises in this way the schema a v =a. For X C § let 
X° := {y? |p € X}. The observation F y > F wy? turns out to be the 
special instance X = @) of the useful property 

(S) XFa => X° Fa’ (substitution invariance). 
In order to verify (S), define w® for a given valuation w in such a way 
that w°p = wp’. We first prove by induction on a that 


(x) wEat’ & w7Fa. 
If a is prime, (*) certainly holds. As regards the induction step, note that 
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wF (arB)? Swka’ap? Sweka’, Bp? 
&w’ Fa, (induction hypothesis) 
Swe Fans. 
The reasoning for v and — is analogous and so (*) holds. Now let X Fa 


and wF X°. By (*), we get w7 F X. Thus w? F a, and again by (x), 
wF a?. This confirms (S). Another important property of F that is not 


so easily obtained will be proved in 1.4, namely 
(F) XEa => Xo Fa for some finite subset Xo C X. 


— shares the properties (R), (M), (T), and (S) with almost every clas- 
sical or nonclassical propositional consequence relation. This is to mean a 
relation - between sets of formulas and formulas of an arbitrary proposi- 
tional language F that has the properties corresponding to (R), (M), (T), 
and, as a rule, (S). These properties are the starting point for a very gen- 
eral theory of logical systems created by Tarski, which underpins nearly all 
logical systems considered in the literature. Should F satisfy the property 
corresponding to (F) then | is called finitary. 


Remark 1. Sometimes (S) is not demanded in defining a consequence relation, 
and if (S) holds, one speaks of a structural consequence relation. We omit this 
refinement. Notions such as tautology, consistency, maximal consistency, and 
so on can be used with reference to any consequence relation | in an arbitrary 
propositional language F. For instance, a set of formulas X is called consistent 
int whenever X ¥ a for some a, and maximally consistent if X is consistent but 
has no proper consistent extension. F itself is called consistent if X ¥ a for some 
X and a (this is equivalent to not | a for all a). Here as always, | a stands for 
t+ a. If ¥ contains — then the consistency of X is often defined by X + a, 7a for 
no a. But the aforementioned definition has the advantage of being completely 
independent of any assumption concerning the occurring connectives. Another 
example of a general definition is this: A formula set X is called deductively 
closed in + provided X | a > a € X, for all a € F. Because of (R), this 
condition can be replaced by X fF a & a € X. Examples in F are the set of 
all tautologies and the whole of F. The intersection of a family of deductively 
closed sets is again deductively closed. Hence, each X C GF is contained in a 
smallest deductively closed set, called the deductive closure of X in F. It equals 
{a € ¥| X F a}, as is easily seen. The notion of a consequence relation can 
also be defined in terms of properties of the deductive closure. We mention that 
(F) holds not just for our relation F that is given by a two-valued matrix, but 
for the consequence relation of any finite logical matrix in any propositional 
language. This is stated and at once essentially generalized in Exercise 3 in 5.7 
as an application of the ultraproduct theorem. 
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A special property of the consequence relation F, easily provable, is 
(D) X,aFB> XFa—B8, 

called the (semantic) deduction theorem for propositional logic. To see 
this suppose X,a@F ( and let w be a model for X. If w F a then by the 
supposition, w F 6, hencewF a>. If wF a then w FE a8 as well. 
Hence X F a —( in any case. This proves (D). As is immediately seen, 
the converse of (D) holds as well, that is, one may replace > in (D) by 
<=. Iterated application of this simple observation yields 


Q1,---,Q€,nF BSE ay mag 7+: SAn MHB SE A AQAQA+ ++ AAn =f. 


In this way, (’s being a logical consequence of a finite set of premises is 
transformed into a tautology. Using (D) it is easy to obtain tautologies. 
For instance, to prove F p —q — 7, it is enough to verify p F q —p, for 
which it in turn suffices to show that p,q F p, and this is trivial. 

Remark 2. By some simple applications of (D) each of the tautologies in the 
examples on page 18 can be obtained, except the formula of Peirce. As we shall 


see in Chapter 2, all properties of F derived above and in the exercises will carry 
over to the consequence relation of a first-order language. 


Exercises 


1. Use the deduction theorem as in the text in order to prove 


(a) Pipsg—r) (og) pr), 
(b) P= Qo @+r)—@>r) 


2. Suppose that X F a +. Prove that X F (y >a) +(7 — 8). 


3. Verify the (rule of) disjunctive case distinction: if X,a F y and 
X,G8F ythen X,av GF +¥. This implication is traditionally written 
more suggestively as 


X,aky|X,BF 4 
X,avBey © 


4. Verify the rules of contraposition (notation as in Exercise 3): 
X,a@F B _  X,7aBF ma 
X,ABE-a X,ak B 
5. Let + be a consequence relation and let X be maximally consistent 
in F (see Remark 1). Show that X is deductively closed in F. 
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1.4 A Calculus of Natural Deduction 


We will now define a derivability relation / by means of a calculus op- 
erating solely with some structural rules. | turns out to be identical to 


the consequence relation F. The calculus ' is of the so-called Gentzen 
type and its rules are given with respect to pairs (X,qa) of formulas X 


and formulas a. Another calculus for F, of the Hilbert type, will be con- 
sidered in 1.6. In distinction to |Ge], we do not require that X be finite; 
our particular goals here make such a restriction dispensable. If + applies 
to the pair (X,qa) then we write X - a and say that a is derivable or 
provable from X (made precise below); otherwise we write X ¥ a. 

Following [K11], Gentzen’s name for (X,a), Sequenz, is translated as 
sequent. The calculus is formulated in terms of ~,— and encompasses the 
six rules below, called the basic rules. How to operate with these rules will 
be explained afterwards. The choice of {,—} as the logical signature is 
a matter of convenience and justified by its functional completeness. The 
other standard connectives are introduced by the definitions 


av B= 7(7aA78), a +8 = A(annB), «4 B= (a +8) A(B a). 


T,1 are defined as on page 5. Of course, one could choose any other 
functional complete signature and adapt the basic rules correspondingly. 
But it should be observed that a complete calculus in =, A,v, —, say, 
must also include basic rules concerning v and —, which makes induction 
arguments on the basic rules of the calculus more lengthy. 

Each of the basic rules below has certain premises and a conclusion. 
Only (IS) has no premises. It allows the derivation of all sequents at a. 
These are called the initial sequents, because each derivation must start 
with these. (MR), the monotonicity rule, could be weakened. It becomes 
even provable if all pairs (X,a) with a € X are called initial sequents. 


= ») 
(IS) ar (initial sequent) (MR) Ya (X" 2 X); 
XEa,B XFang 
1 2 
ang 7) rae 
XFa,7a X,at 8 | X,n7nat B 
Cl) YF B i) XFB 
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Here and in the following X  a,@ is to mean X F a and X F GZ. This 
convention is important, since X - a, has another meaning in Gentzen 
calculi that operate with pairs of sets of formulas. The rules (1) and 
(=1) actually have two premises, just like (42). Note further that (2) 
really consists of two subrules corresponding to the conclusions X F @ 
and X F 3. In (=2), X,a means X U {a}, and this abbreviated form will 
always be used when there is no risk of misunderstanding. 

Q1,---,Q@, / @ stands for {a1,...,@n,} + @; in particular, a - (@ for 
{a} 8, and a for 0+ a, just as with F. 


X - a (read from “X is provable or derivable a”) is to mean that the 


sequent (X,qa@) can be obtained after a stepwise application of the basic 
rules. We can make this idea of “stepwise application” of the basic rules 
rigorous and formally precise (intelligible to a computer, so to speak) in 
the following way: a derivation is to mean a finite sequence (So;...; Sin) 
of sequents such that every S; is either an initial sequent or is obtained 
through the application of some basic rule to preceding elements in the 
sequence. Thus, from X is derivable a if there is a derivation (So;...; Sin) 
with S;, = (X,a). A simple example with the end sequent a, 3 anQ, 
or minutely ({a, 3},a 3), is the derivation 
(ata;a,Br-a;BFB;a,BFB;a,BF ang). 

Here (MR) was applied twice, followed by an application of (a1). Not 
shorter would be complete derivation of the sequent (0, T), i-e., a proof of 
+ T. In this example both (71) and (-2) are essentially involved. 

Useful for shortening lengthy derivations is the derivation of additional 
rules, which will be illustrated with the examples to follow. The second 
example, a generalization of the first, is the often-used proof method re- 
ductio ad absurdum: a is proved from X by showing that the assumption 
ma leads to a contradiction. The other examples are given with respect 
to the defined —-connective. Hence, for instance, the —-elimination 


XEna = 
mentioned below runs in the original language eat) 
X,ak 8 
Examples of derivable rules 
X,7ab 
—— proof applied 


1 X,akFa_ (IS), (MR) 
2 X,7a@' a _ supposition 
3 XFa (72) 


(—-elimination) 
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lage proof applied 
(reductio ad apsedue) 1 X,7at 8,76 supposition 
2 X,7abra (=1) 
3 Xa —-elimination 
XFa-p 
X,ak B 
( +-elimination) tl xX,6, 48 Fag (IS), (MR) 
2 X,a,7aBl aanp (a1) 
3 XE A7A(aa78) supposition 
4 X,a,7AB' =(an7B) (MR) 
5 X,a,nabk B (-1) on 2 and 4 
6 X,ak GB —-elimination 
Xta| X,at B 
XFB 
(cut rule) 1 X,7-aka supposition, (MR) 
2 X,7ak 7a (IS), (MR) 
a: Xorg (=1) 
4 X,at Pp supposition 
5 XFB (2) on 4 and 3 
X,at B 
XFa-6B 
( — -introduction) 1 X,an7AB,at GB supposition, (MR) 
2 X,avn7nBblea (IS), (MR), (a2) 
3. X,aanBk B cut rule 
4 X,aa7nBl 7G (IS), (MR), (a2) 
5 X,aanBra-ZG (-1) 
6 X,7(av7B)F/ af (IS), (MR) 
7 XFa>6 (=2) on 5 and 6 


Remark 1. The example of — -introduction is nothing other than the syntactic 
form of the deduction theorem that was semantically formulated in the previous 
section. The deduction theorem also holds for intuitionistic logic. However, it is 
not in general true for all logical systems dealing with implication, thus indicating 
that the deduction theorem is not an inherent property of every meaningful 
conception of implication. For instance, the deduction theorem does not hold 
for certain formal systems of relevance logic that attempt to model implication 
as a cause-and-effect relation. 
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A simple application of the —-elimination and the cut rule is a proof 
of the detachment rule 


XFaa-> 
XI BG 
Indeed, the premise X | a —@ yields X,a+ § by —-elimination, and 
since X F a, it follows X + @ by the cut rule. Applying detachment on 
X = {a,a >}, we obtain a,a—> Ft GZ. This collection of sequents is 
known as modus ponens. It will be more closely considered in 1.6. 


Many properties of F are proved through rule induction, which we de- 
scribe after introducing some convenient terminology. We identify a prop- 
erty € of sequents with the set of all pairs (X,a) to which € applies. In 
this sense the logical consequence relation F is the property that applies 
to all pairs (X,a) with X Fa. 


All the rules considered here are of the form 


Xi bay | Sages | Mpa 


ft XFa 


and are referred to as Gentzen-style rules. We say that € is closed under 
R when €(X1,a1),...,E(Xn, Qn) implies E(X,a). For a rule without 
premises, i.e., n = 0, this is just to mean €(X,qa). For instance, consider 
the above already mentioned property € : X F a. This property is closed 
under each basic rule of . In detail this means 


aFa, XFa=>X'Fafor X'DX, XFa,BsoXFanQB, etc. 


From the latter we may conclude that € applies to all provable sequents; 


in other words, + is (semantically) sound. What we need here to verify 
this conclusion is the following easily justifiable 


Principle of rule induction. Let € (C $F x F) be a property closed 
under all basic rules of +. Then X + a implies E(X, a). 


Proof by induction on the length of a derivation of S = (X,a). If 
the length is 1, ES holds since S must be an initial sequent. Now let 
(So;.--;Sn) be a derivation of the sequent S := S,. By the induction 
hypothesis we have €S; for alli < n. If S is an initial sequent then €S 
holds by assumption. Otherwise S has been obtained by the application 
of a basic rule on some of the S$; for i <n. But then €S' holds, because € 
is closed under all basic rules. [J 
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As already remarked, the property X F a is closed under all basic 
rules. Therefore, the principle of rule induction immediately yields the 


soundness of the calculus, that is, F C F. More explicitly, 


XFa => XEa, for all X,a. 


There are several equivalent definitions of Ff. A purely set-theoretic 
one is the following: - is the smallest of all relations C PF x F that are 
closed under all basic rules. - is equally the smallest consequence relation 
closed under the rules (1) through (42). The equivalence proofs of such 
definitions are wordy but not particularly contentful. We therefore do not 
elaborate further, because we henceforth use only rule induction. Using 
rule induction one can also prove X F a => X° F a’, and in particular 
the following theorem, for which the soundness of | is irrelevant. 


Theorem 4.1 (Finiteness theorem for +). Jf X + a then there is a 
finite subset Xp C X with Xora. 


Proof. Let €(X,a) be the property ‘Xo | a for some finite X9 C X’. 
We will show that € is closed under all basic rules. Certainly, €(X, a) 
holds for X = {a}, with Xo = X so that € is closed under (IS). If X has 
a finite subset Xo such that Xp + a, then so too does every set X’ such 
that X' D> X. Hence € is closed under (MR). Let E(X, a), E(X, 8), with, 
say, X1 a, Xo @ for finite X1, X2 C X. Then we also have Xo k a, 3 
for Xo = X1,U X2 by (MR). Hence Xp Fk anZ by (a1). Thus E(X, an) 
holds, and € is closed under (a1). Analogously one shows the same for 
all remaining basic rules of + so that rule induction can be applied. [J 


Of great significance is the notion of formal consistency. It fully deter- 
mines the derivability relation, as the lemma to come shows. It will turn 
out that consistent formalizes adequately the notion satisfiable. 


Definition. X C F is called inconsistent (in our calculus +) if X F a for 
all a € F, and otherwise consistent. X is called maximally consistent if 
X is consistent but each Y > X is inconsistent. 


The inconsistency of X can be identified by the derivability of a single 
formula, namely 1 (= piA—p1), because X - 1 implies X + pi,—7pi by 
(a2), hence X F a for all a by (7-1). Conversely, when X is inconsistent 
then in particular X + 1. Thus, X + 1 may be read as ‘X is inconsistent’, 
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and X ¥ 1 as ‘X is consistent’. From this it easily follows that X is 
maximally consistent iff either a € X or —a € X for each a. The latter 
is necessary, for a, na ¢ X implies X,a@+ 1 and X,7a@+ 1, hence X F 1 
by (7-2), contradicting the consistency of X. Sufficiency is obvious. Most 
important is the following lemma. It confirms that derivability is reducible 
to consistency and the reader should have it down pat. 


Lemma 4.2. The derivability relation + has the properties 
Cr: XkFas X,-7-aF 1, Cc: XFraa S&S X,aF i. 


Proof. Suppose that X + a. Then clearly X,7a+ a@ and since certainly 
X,7a+t 7a, we have X,7a + £ for all G by (71), in particular X,7aF 1. 
Conversely, let X,7a@ + 1 be the case, so that in particular X,-a@ F a, 


and thus X - a@ by —-elimination on page 23. Property C~ is proved 
completely analogously. [J 


The claim F CF, not yet proved, is equivalent to X Ka > X Fa, 
for all X and a. But so formulated it becomes apparent what needs to 
be done to obtain the proof. Since X ¥ a is by CT equivalent to the 
consistency of X’ := X U {7a}, and X F a to the satisfiability of X’, we 
need only show that consistent sets are satisfiable. To this end we state 
the following lemma, whose proof, exceptionally, jumps ahead of matters 
in that it uses Zorn’s lemma from 2.1 (page 46). 


Lemma 4.3 (Lindenbaum’s theorem). Every consistent set X C F¥ 
can be extended to a maximally consistent set X' DX. 


Proof. Let H be the set of all consistent Y D X, partially ordered with 
respect to C. H 4 0), because X € H. Let K C A be a chain, i., 
Y C Z or Z CY, for all Y,Z € K. Claim: U := UK is an upper bound 
for kK. Since Y € K => Y CU, we have to show that U is consistent. 
Assume that UF 1. Then Up + for some finite Up = {ao,...,an} CU. 
If, say, a; € Y; € K, and Y is the biggest of the sets Yo,..., Yn, then 
a; € Y for all i <n, hence also Y + 1 by (MR). This contradicts Y € H 
and confirms the claim. By Zorn’s lemma, H has a maximal element X’, 
which is necessarily a maximally consistent extension of X. [J 


Remark 2. The advantage of this proof is that it is free of assumptions regarding 
the cardinality of the language, while Lindenbaum’s original construction deals 
with countable languages F only and does not require Zorn’s lemma, which is 
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equivalent to the axiom of choice. Lindenbaum’s argument runs as follows: Let 
Xo := X C F be consistent and let ao,a1,... be an enumeration of F. Put 
Xnii = Xn U {ay} if this set is consistent and X,4, = X, otherwise. Then 


Y = Unew Xn is a maximally consistent extension of X, as is easily verified. 


Lemma 4.3 can also be shown very similar to this approach, using an ordinal 
enumeration of X instead of Zorn’s Lemma. 


Lemma 4.4. A maximally consistent set X C F has the property 

[>] XFnaa & X Fa, for arbitrary a. 
Proof. If X / na, then X ' a cannot hold due to the consistency of X. 
If, on the other hand, X ¥ a, then X,-7a@ is a consistent extension of X 


according by Ct. But then sa € X, because X is maximally consistent. 
Consequently X ma. [J 


Lemma 4.5. A maximally consistent set X is satisfiable. 


Proof. Define w by wEp = X Fp. We will show that for all a, 
(x) XFa & wka. 


For prime formulas this is trivial. Further, 


XtaaB @& Xa, (rules (al), (A2) ) 
= wFa,f (induction hypothesis) 
= wkEaag (definition) 

xX F 7a ~~ XFa (Lemma 4.4) 
= wka (induction hypothesis) 
= wkEr7a (definition). 


By (x), w is a model for X, thereby completing the proof. [J 


Only the properties [A] X F}aanB@ Xa, and [-] from Lemma 4.4 
are used in the simple model construction in Lemma 4.5, which reveals 
the requirements for propositional model construction in the base {A , 7}. 
Since maximally consistent sets X are deductively closed (Exercise 5 
in 1.3), these requirements may also be stated as 

(A)aaBpEexX ea BEX ; (A) paEXSag€gX. 

Lemma 4.3 and Lemma 4.5 confirm the equivalence of the consistency 
and the satisfiability of a set of formulas. From this fact we easily obtain 
the main result of the present section. 
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Theorem 4.6 (Completeness theorem). X | a © X F a, for all 
formula sets X and formulas a. 


Proof. The direction = is the soundness of +. Conversely, X * a@ implies 
that X,—-a@ is consistent. Let Y be a maximally consistent extension of 
X,7qa according to Lemma 4.3. By Lemma 4.5, Y is satisfiable, hence 
also X,7a. Therefore X Fa. LJ 


An immediate consequence of Theorem 4.6 is the finiteness property 


(F) mentioned in 1.3, which is almost trivial for + but not for F: 


Theorem 4.7 (Finiteness theorem for F). [f X F a, then so too 
Xo Fa for some finite subset Xo of X. 


This is clear because the finiteness theorem holds for F (Theorem 4.1), 
hence also for F. A further highly interesting consequence of the com- 
pleteness theorem is 


Theorem 4.8 (Propositional compactness theorem). A set X of 
propositional formulas is satisfiable if each finite subset of X is satisfiable. 


This theorem holds because if X is unsatisfiable, i.e., if X F 1, then, by 
Theorem 4.7, we also know that Xo F 1 for some finite Xg9 C X, thus 


proving the claim indirectly. Conversely, one easily obtains Theorem 4.7 
from Theorem 4.8. Neither Theorem 4.6 nor Theorem 4.8 make any as- 
sumptions regarding the cardinality of the set of variables. This fact has 
many useful applications, as the next section will illustrate. 

Let us notice that there are direct proofs of Theorem 4.8 or appropri- 
ate reformulations that have nothing to do with a logical calculus. For 
example, the theorem is equivalent to 


Nacx Mda=9 > (ocx, Mda = 0 for some finite Xo C X, 


where Mda denotes the set of all models of a. In this formulation the 
compactness of a certain naturally arising topological space is claimed. 
The points of this space are the valuations of the variables, hence the 
name “compactness theorem.” More on this can be found in [RS]. 


Another approach to completeness (probably the simplest one) is pro- 
vided by Exercises 3 and 4. This approach makes some elegant use of 
substitutions, hence is called the completeness proof by the substitution 
method. This method is explained in the Solution Hints (and in more 
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detail in |[Ra3]). It yields the maximality of the derivability relation - 
(see Exercise 3), a much stronger result than its semantic completeness. 
This result yields not only the Theorems 4.6, 4.7, and 4.8 in one go, but 
also some further remarkable properties: Neither new tautologies nor new 
Hilbert style rules can consistently be adjoined to the calculus +. These 
properties (discussed in detail, e.g., in [Ral]) are known under the names 
Post completeness and structural completeness of , respectively. 


Exercises 


1. Prove using Theorem 4.7: if X U {7a |a € Y} is inconsistent and 
Y is nonempty, then there exist formulas ag,...,@, € Y such that 
XFagv +: Vn. 


2. Augment the signature {—, ~} by v and prove the completeness of 
the calculus obtained by supplementing the basic rules used so far 
with the rules 

Xa Aa | Ory 
(v1) : ; 


2, 
XkFavB,Bva ’ v2) X,av Bey 


3. Let | be a finitary consistent consequence relation in F{~,—} with 
the properties (A1) through (=2). Show that is mazimal (or 
maximally consistent). This means that each consequence relation 
HYD - in F{A,-} is inconsistent, i.e., +’ @ for all a. 


4. Show by referring to Exercise 3: there is exactly one (consistent) 
consequence relation in F{~,—-} satisfying (~1)—(-2). This clearly 
entails the completeness of F. 


1.5 Applications of the Compactness Theorem 


Theorem 4.8 is very useful in carrying over certain properties of finite 
structures to infinite ones. This section presents some typical examples. 
While these could also be treated with the compactness theorem of first- 
order logic in 3.3, the examples demonstrate how the consistency of cer- 
tain sets of first-order sentences can also be obtained in propositional logic. 
This approach to consistency is also useful also for Herbrand’s theorem 
and related results concerning logic programming. 
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1. Every set M can be (totally) ordered. ° 


This means that there is an irreflexive, transitive, and connex relation < 
on M. For finite M this follows easily by induction on the number of 
elements of M. The claim is obvious when M = @ or is a singleton. Let 
now M = N U {a} with an n-element set N and a ¢ N, so that M has 
n+ 1 elements. Then we clearly get an order on M from that for N by 
“setting a to the end,” that is, defining x < a for all x € N. 


Now let M be any set. We consider for every pair (a,b) € Mx Ma 

propositional variable p,y. Let X be the set consisting of the formulas 
“Paa (a eM ) 
Pab/ Poe ~ Pac (a, bce M), 
PabV Poa = (a #D). 

From w F X we obtain an order <, simply by puttinga<b © wE pgp. 
w F 7paa says the same thing as a < a. Analogously, the remaining 
formulas of X reflect transitivity and connexity. Thus, according to The- 
orem 4.8, it suffices to show that every finite subset Xo C X has a model. 
In Xo only finitely many variables occur. Hence, there are finite sets 
M, CM and X, D Xo, where Xj, is given exactly as X except that a, b,c 
now run through the finite set M, instead of M. But Xy is satisfiable, 
because if < orders the finite set M, and w is defined by w F pgp iff a < 8, 
then w is clearly a model for X 7, hence also for Xo. 


2. The four-color theorem for infinite planar graphs. 

A simple graph is a pair (V, £) with an irreflexive symmetrical relation 
E CV?*. The elements of V are called points or vertices. It is convenient 
to identify F with the set of all unordered pairs {a,b} such that afb and 
to call these pairs the edges of (V, F). If {a,b} € E then we say that a,b 
are neighbors. (V,F) is said to be k-colorable if V can be decomposed 
into k color classes C),...,Cy #9, V = Cy, U---UC,, with CNC; = 0 
for 7 # j, such that neighboring points do not carry the same color; in 
other words, if a,b € C; then {a,b} ¢ E fori =1,...,k. 


> Unexplained notions are defined in 2.1. Our first application is interesting because in 
set theory the compactness theorem is weaker than the axiom of choice (AC) which 
is equivalent to the statement that every set can be well-ordered. Thus, the ordering 
principle is weaker than AC since it follows from the compactness theorem. 


32 1 Propositional Logic 


The figure shows the smallest four-colorable graph 

that is not three-colorable; all its points neighbor 

each other. We will show that a graph (V,£) is 

k-colorable if every finite subgraph (Vo, Eo) is k- 

colorable. Ep consists of the edges {a,b} € E with 
a,b € Vo. To prove our claim consider the following set X of formulas 
built from the variables pa fora € V and 1 <i<k: 


Pal VY *** VY Pa,k> =| Dash Bog) (a S V7 1<i <j < k), 

“(Deg Aya) (fab) eB, t= 1,25; k). 
The first formula states that every point belongs to at least one color class; 
the second ensures their disjointedness, and the third that no neighboring 
points have the same color. Once again it is enough to construct some 
w KF X. Defining then the C; by a € C; @ w F pai proves that (V, E) is 
k-colorable. We must therefore satisfy each finite X»9 C X. Let (Vo, Eo) 
be the finite subgraph of (V, £) of all the points that occur as indices in 
the variables of Xo. The assumption on (Vo, Eo) obviously ensures the 
satisfiability of Xq for reasons analogous to those given in Example 1, and 
this is all we need to show. The four-color theorem says that every finite 
planar graph is four-colorable. Hence, the same holds for all graphs whose 
finite subgraphs are planar. These cover in particular all planar graphs 
embeddable in the real plane. 


3. KG6nig’s tree lemma. There are several versions of this lemma. For 
simplicity, ours refers to a directed tree. This is a pair (V,<J) with an 
irreflexive relation < C V? such that for a certain point c, the root of the 
tree, and any other point a there is precisely one path connecting c with 
a. This is a sequence (ai)i<n with a9 = C, Gn = a, and a; <I aj+1 for all 
i <n. From the uniqueness of a path connecting c with any other point 
it follows that each b 4 c has exactly one predecessor in (V,<)), that is, 
there is precisely one a with a <b. Hence the name tree. 


Konig’s lemma then reads as follows: If every a € V has only finitely 
many successors and V contains arbitrarily long finite paths, then there 
is an infinite path through V_ starting at c. By such a path we mean a 
sequence (cj);en such that co = c and cg <I ce41 for each k. In order 
to prove the lemma we define the “layer” S; inductively by So = {c} 
and S,41 = {b € V | there is some a € 5S; with a < b}. Since every point 
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has only finitely many successors, each S$, is finite, and since there are 
arbitrarily long paths c < aj <4 --- < az and ag € Sz, no Sz is empty. 
Now let pa for each a € V be a propositional variable, and let X consist 
of the formulas 

(A) Vaes, Par —=(ParDp) (a,b EC Sp, ax#b, kE N), 

(B) Db Da (a,beEV, a <b). 
Suppose that w — X. Then by the formulas under (A), for every k there 


is precisely one a € S, with w F pg, denoted by cg. In particular, co = c. 
Moreover, cz <1 cx41 for all k. Indeed, if a is the predecessor of b = cx 41, 
then w F pq in view of (B), hence necessarily a = cy. Thus, (c;)jen is a 
path of the type sought. Again, every finite subset Xo C X is satisfiable; 
for if Xo contains variables with indices up to at most the layer S,, then 
Xo is a subset of a finite set of formulas X, that is defined as X, except 
that & runs only up to n, and for this case the claim is obvious. 


4. The marriage problem (in linguistic guise). 

Let N 4 @ bea set of words or names (in speech) with meanings in a set 
M. Anamev € N can bea synonym (ie., it shares its meaning with other 
names in NV), or a homonym (i.e., it can have several meanings), or even 
both. We proceed from the plausible assumption that each name v has 
finitely many meanings only and that k names have at least k meanings. 
It is claimed that a pairing-off exists; that is, an injection f:N — M 
that associates to each y one of its original meanings. 

For finite NV, the claim will be proved by induction on the number n of 
elements of N. It is trivial for n = 1. Now let n > 1 and assume that the 
claim holds for all k-element sets of names whenever 0 < k < n. 

Case 1: For each k (0 < k <n): k& names in N have at least k + 1 
distinct meanings. Then to an arbitrarily chosen v from N, assign one of 
its meanings a to it so that from the names out of N\{v} any k names 
still have at least k meanings 4 a. By the induction hypothesis there is a 
pairing-off for N\{v} that together with the ordered pair (v,a) yields a 
pairing-off for the whole of N. 

Case 2: There is some k-element K C N (0 < k <n) such that the 
set Mr of meanings of the vy € K has only k members. Every v € K 
can be assigned its meaning from Mx by the induction hypothesis. From 
the names in N\ Kk any i names (i < n— k) still have 7 meanings not in 
Mx, as is not hard to see. By the induction hypothesis there is also a 
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pairing-off for N\ Kk with a set of values from M\Myx. Joining the two 
obviously results in a pairing-off for the whole of N. 

We will now prove the claim for arbitrary sets of names N: assign to 
each pair (v,a) € N x M a variable p,_ and consider the set of formulas 

x. { PuaVv«': VPue (VEN, a,...,e the meanings of v), 
UPuacrPvy) (VEN, 2, yeM, x#y). 

Assume that w F X. Then to each v there is exactly one a, with w F pya,, 
so that {(v,a,) |v € N} is a pairing-off for N. Such a model w exists by 
Theorem 4.8, for in a finite set X9 C X occur only finitely many names 


as indices and the case of finitely many names has just been treated. 


5. The ultrafilter theorem. 

This theorem is of fundamental significance in topology (from which it 
originally stems), model theory, set theory, and elsewhere. Let I be any 
nonempty set. A nonempty collection of sets F' C ¥T is called a filter on 
I if for all M,N CTI hold the conditions 


(a) MNEFS>MOAN EP, (b) MEF &MCNSNEF. 
Since F' 4 @, (b) shows that always I € F’. As is easily verified, (a) and 
(b) together are equivalent to just a single condition, namely to 


(n) MNNE€F & MeEFandNEF. 


For fixed K CI, {J CI| JD K} is a filter, the principal filter generated 
by K. This is a proper filter provided K 4 (), which in general is to mean 
a filter with @ ¢ F. Another example on an infinite I is the set of all 
cofinite subsets M C I, i.e., >M (= I\M) is finite. This holds because 
M1 Mz is cofinite iff MM, M2 are both cofinite, so that (n) is satisfied. 
A filter Fis said to be an ultrafilter on I provided it satisfies, in addition, 


(7) IMeFsoMéF. 


Ultrafilters on an infinite set J containing all cofinite subsets are called 
nontrivial. That such ultrafilters exist will be shown below. It is nearly 
impossible to describe them more closely. Roughly speaking, “we know 
they exist but we cannot see them.” A trivial ultrafilter on J contains at 
least one finite subset. {J C I | %9 € J} is an example for each ig € J. 
This is a principal ultrafilter. All trivial ultrafilters are of this form, 
Exercise 3. Thus, trivial and principal ultrafilters coincide. In particular, 
each ultrafilter on a finite set J is trivial in this sense. 
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Each proper filter F’ obviously satisfies the assumption of the following 
theorem and can thereby be extended to an ultrafilter. 


Theorem 5.1 (Ultrafilter theorem). Every subset F C PI can be 
extended to an ultrafilter U on a set I, provided Mo QN---AM, 49 for all 
n and all Mo,...,Mn € F. 
Proof. Consider along with the propositional variables p, for J C I 

X Pun Pu Pwr Pam Pur Py (MN CI, JE F). 
Let w — X. Then (n),(-) are valid for U := {J CI |wF p,}; hence U 
is an ultrafilter such that F' C U. It therefore suffices to show that every 


finite subset of X has a model, for which it is in turn enough to prove the 
ultrafilter theorem for finite F'’. But this is easy: Let F = {Mo,..., Mn}, 
D := Mon---M Mp, and igo € D. Then U = {J CI | ig € J} is an 
ultrafilter containing F. [J 


Exercises 


1. Prove (using the compactness theorem) that every partial order <o 
on aset M can be extended to a total order < on M. 


2. Let F' be a proper filter on I (4 0). Show that F is an ultrafilter iff 
it satisfies (u): MUNEF © Me ForNeEF. 


3. Let J be an infinite set. Show that an ultrafilter U on J is trivial iff 
there is an ig € J such that U={J CT|lio € J}. 


1.6 Hilbert Calculi 


In a certain sense the simplest logical calculi are so-called Hilbert calculi. 
They are based on tautologies selected to play the role of logical axioms; 
this selection is, however, rather arbitrary and depends considerably on 
the logical signature. They use rules of inference such as, for example, 
modus ponens MP: a,a >/3.° An advantage of these calculi consists 


° Putting it crudely, this notation should express the fact that 3 is held to be proved 
from a formula set X when a and a > £ are provable from X. Modus ponens is an 
example of a binary Hilbert-style rule; for a general definition of this type of rule see, 
for instance, [Ral]. 
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in the fact that formal proofs, defined below as certain finite sequences, 
are immediately rendered intuitive. This advantage will pay off above all 
in the arithmetization of proofs in 6.2. 

In the following we consider such a calculus with MP as the only rule 
of inference; we denote this calculus for the time being by fb, in order to 
distinguish it from the calculus | of 1.4. The logical signature contains 
just — and a, the same as for +. In the axioms of }, however, we will 
also use implication defined by a > := =(a@a-{), thus considerably 
shortening the writing down of the axioms. 

The logical axiom scheme of our calculus consists of the set A of all 
formulas of the following form (not forgetting the right association of 
parentheses in Al, A2, and A4): 


Al (a+8-+7) +(a-8) >a, A2 a+f-anB, 

A383 anaBra, anB—B, A4. (a >78) +8 7a. 
A consists only of tautologies. Moreover, all formulas derivable from A 
using MP are tautologies as well, because F a,a — 3 implies F 3. We will 


show that all 2-valued tautologies are provable from A by means of MP. 
To this aim we first define the notion of a proof from X C F in bk. 


Definition. A proof from X (in ) is a sequence ® = (yo,..., Yn) such 
that for every k < n either y, € X UA or there exist indices 1,7 < k 
such that yp; = Yi > yp (ie., Y, results from applying MP to terms of ® 
preceding yz). A proof (Yo,.--,;Yn) with yp, = a is called a proof of a 
from X of length n+1. Whenever such a proof exists we write X a and 
say that a is provable or derivable from X. 


Example. (p,q,p ~¢ ~PAQG,¢d ~PAQ,PAQ) is a proof of pag from the 
set X = {p,q}. The last two terms in the proof sequence derive with MP 
from the previous ones, which all are members of X UA. 


Since a proof contains only finitely many formulas, the preceding def- 
inition leads immediately to the finiteness theorem for }, formulated 
correspondingly to Theorem 4.1. Every proper initial segment of a proof 
is obviously a proof itself. Moreover, concatenating proofs of a and a > 3 
and tacking on ( to the resulting sequence will produce a proof for (, as 
is plain to see. This observation implies 


(x) Xha,a-B => XB. 
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In short, the set of all formulas derivable from X is closed under MP. In 
applying the property (*) we will often say “MP yields ...” It is easily 
seen that Xba iff a belongs to the smallest set containing X U A and 
closed under MP. For the arithmetization of proofs and for automated 
theorem proving, however, it is more appropriate to base derivability on 
the notion of a proof given in the last definition, because of its finitary 
character. Fortunately, the following theorem relieves us of the necessity 
to verify a property of formulas a derivable from a given formula set X 
each time by induction on the length of a proof of a from X. 


Theorem 6.1 (Induction principle for ). Let X be given and let € 
be a property of formulas. Then E holds for all a with X ha, provided 
(0) € holds for allac XUA, 
(s) Ea and E(a +8) imply EB, for all a, (. 


Proof by induction on the length n of a proof ® of a from X. Ifa €¢ XUA 
then Ea holds by (0), which applies in particular ifn =1. fag XUA 
then n > 1 and ® contains members a; and a; = a; >a both having 
proofs of length < n. Hence, it holds €a; and Ea; by the induction 
hypothesis, and so Ea according to (s). LJ 


An application of Theorem 6.1 is the proof of h CF, or more explicitly, 
Xtra > XFa (soundness). 
To see this let Ea be the property ‘X F a’ for fixed X. Certainly, X F a 
holds for a € X. The same is true for a € A. Thus, Ea for alla € X UA, 
and (0) is confirmed. Now let X F a,a—(; then so too X F £3, thus 
confirming the inductive step (s) in Theorem 6.1. Consequently, Ea (that 
is, X F a) holds for all a with X ka. 


Unlike the proof of completeness for +, the one for k requires a whole 


series of derivations to be undertaken. This is in accordance with the 
nature of things. To get Hilbert calculi up and running one must often 
begin with drawn-out derivations. In the derivations below we shall use 
without further comment the monotonicity (M) (page 19, with - for F). 
(M) is obvious, for a proof in } from X is also a proof from X’ D X. 
Moreover, + is a consequence relation (as is every Hilbert calculus, based 
on Hilbert style rules). For example, if X KY ka, we construct a proof of 
a from X by replacing each y € Y occurring in a proof of a from Y by a 
proof of y from X. This confirms the transitivity (T). 
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Lemma 6.2. (a) XkKa>7A6 > XhG->-7a, (b) Ka-GB-a, 

(c) Kaa, (d) Ka---7a, (e) KB -7AB a. 
Proof. (a): Clearly X h(a +7) +8 >+-7a by Axiom A4. From this and 
from X ha —-( the claim is derived by MP. (b): By A383, KBA7a > 7a, 
and so with (a), ka ~7(Gra-a) =a-P a. 
(c): From 7 :=a, 6 :=a—a in Al we obtain 

K(a+(a >a) >a) +(a>-a>a) >a >a, 

which yields the claim by applying (b) and MP twice; (d) then follows 
from (a) using K7a —-7a. (e): Due to KaGAnra —>-6 and (a), we get 
Pear) af oapaa, Ll 


Clearly, } satisfies the rules (A1) and (A2) of 1.4, in view of A2, A3. 
Part (e) of Lemma 6.2 yields X 3,73 > Xba, so that } satisfies also 
rule (=1). After some preparation we will show that rule (=2) holds for 


t as well, thereby obtaining the desired completeness result. A crucial 
step in this direction is 

Lemma 6.3 (Deduction theorem). X,ay implies X hay. 
Proof by induction in ' with a given set X,a. Let X,aby, and let Ey 
now mean ‘X /}a —7’. To prove (0) in Theorem 6.1, let y ¢ AUX U{a}. 
If y = a then clearly X ka +7 by Lemma 6.2(c). If y € X UA then cer- 
tainly X}y. Because also X hy >a —y by Lemma 6.2(b), MP yields 
X ha —y, thus proving (0). To show (s) let X,a’@ and X,ak8—-y7, 
so that X ka > 8,a —£ 7 by the induction hypothesis. Applying MP 
to Al twice yields X ha 7, thus confirming (s). Therefore, by Theo- 
rem 6.1, €y for all y~, which completes the proof. [J 


Lemma 6.4. 77a >a. 


Proof. By A3 and MP, ~7=a,7ah-7a,77a. Choose any 7 with 7. 
The already verified rule (41) clearly yields ->~a,7a@)/-7, and in view 


of Lemma 6.3, Ka7aa@A7a—>-77. From Lemma 6.2(a) it follows that 
hr +-7(47aA7a@). But 7, hence using MP we obtain bh 7=(-=7a 7a) 
and the latter formula is just >7a >a. [J 


Lemma 6.3 and Lemma 6.4 are preparations for the next lemma, which 
is decisive in proving the completeness of +. 
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Lemma 6.5. + satisfies also rule (2) of the calculus F. 


Proof. Let X,Gha and X,78ha; then X,Gh—77a and X,7AGBh-77a 
by Lemma 6.2(d). Hence, XG —--7a,76 +--7a (Lemma 6.3), and 
so X bra +78 and X Kra +76 by Lemma 6.2(a). Thus, MP yields 
X,7abh768,778, whence X,7ah-7 by (71), with 7 as in Lemma 6.4. 
Therefore X /7>a@ —-77, due to Lemma 6.3, and hence X hT + 77a by 
Lemma 6.2(a). Since X/7 it follows that X/—-7a@ and so eventually 
Xba by Lemma 6.4. LJ 


Theorem 6.6 (Completeness theorem). / = F. 


Proof. Clearly, h C F. Now, by what was said already on page 38 and 
by the lemma above, | satisfies all basic rules of F. Therefore, /C bk. 
Since k = F (Theorem 4.6), we obtain alsoF C F. [LJ 


This theorem implies in particular hy =F y. In short, using MP one 
obtains from the axiom system A exactly the two-valued tautologies. 


Remark 1. It may be something of a surprise that Al—A4 are sufficient to obtain 
all propositional tautologies, because these axioms and all formulas derivable 
from them using MP are collectively valid in intuitionistic and minimal logic. 
That A permits the derivation of all two-valued tautologies is based on the fact 
that — was defined. Had — been considered as a primitive connective, this 
would no longer have been the case. To see this, alter the interpretation of 7 
by setting -0 = —=1 = 1. While one here indeed obtains the value 1 for every 
valuation of the axioms of A and formulas derived from them using MP, one does 
not do so for =7>p > p, which therefore cannot be derived. Modifying the two- 
valued matrix or using many-valued logical matrices is a widely applied method 
to obtain independence results for logical axioms. 


Thus, we have seen that there are very different calculi for deriving 


tautologies or to recover other properties of the semantic relation F. We 
have studied here to some extend Gentzen-style and Hilbert-style calculi 
and this will be done also for first-order logic in Chapter 2. In any case, 
logical calculi and their completeness proofs depend essentially on the 
logical signature, as can be seen, for example, from Exercise 1. 


Besides Gentzen- and Hilbert-style calculi there are still other types 
of logical calculi, for example various tableau calculi, which are above all 
significant for their generalizations to nonclassical logical systems. Related 
to tableau calculi is the resolution calculus dealt with in 4.3. 
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Using Hilbert-style calculi one can axiomatize 2-valued logic in other 
logical signatures and functional incomplete fragments. For instance, the 
fragment in ~,v, which, while having no tautologies, contains a lot of 
interesting Hilbert-style rules. Proving that this fragment is axiomatizable 
by finitely many such rules is less easy as might be expected. At least nine 
Hilbert rules are required. Easier is the axiomatization of the well-known 

—-fragment in Exercise 3, less easy that of the v-fragment in Exercise 4. 
Each of the infinitely many fragments of two-valued logic with or without 
tautologies is axiomatizable by a calculus using only finitely many Hilbert- 
style rules of its respective language, as was shown in |HeR]. 

Remark 2. The calculus in Exercise 4 that treats the fragment in v alone, is 
based solely on unary rules. This fact considerably simplifies the matter, but 


the completeness proof is nevertheless nontrivial. For instance, the indispensable 
tule (a3)y/a(G7) is derivable in this calculus, since a tricky application of the 


rules (3) and (4) yields (a8)y F (a8) F (ya)8 F Biya) F (By)a F a(Gy). 
Much easier would be a completeness proof of this fragment with respect to the 
Gentzen-style rules (v1) and (v2) from Exercise 2 in 1.4. 


Exercises 


1. Prove the completeness of the Hilbert calculus in F{—, 1} with 
MP as the sole rule of inference, the definition ~a := a@ — 1, and 
the axioms Al: a>—-a, A2: (a>B—-7) -(a >) -a->4, 
and A3: =7a >a. 


2. Let | be a finitary consequence relation and let X ¥ y. Use Zorn’s 
lemma to prove that there is a y-maximal Y D X, that is, Y K y 
but Y,a+ y whenever a ¢ Y. Such a Y is deductively closed but 
need not be maximally consistent. 


3. Let + denote the calculus in ¥{ +} with the rule of inference MP, the 
axioms Al, A2 from Exercise 1, and ((a - 3) ~a) —a (the Peirce 
axiom). Verify that (a) a y-maximal set X is maximally consistent, 
(b) F is a complete calculus in the propositional language F{—}. 


4. Show the completeness of the calculus + in F¥{v} with the four unary 
Hilbert-style rules below. The writing of v has been omitted: 


(1)a/aB, (2)aa/a, (3) aB8/Ba, (4) a(By)/(a8)7. 


Chapter 2 
First-Order Logic 


Mathematics and some other disciplines such as computer science often 
consider domains of individuals in which certain relations and operations 
are singled out. When using the language of propositional logic, our abil- 
ity to talk about the properties of such relations and operations is very 
limited. Thus, it is necessary to refine our linguistic means of expres- 
sion, in order to procure new possibilities of description. To this end, one 
needs not only logical symbols but also variables for the individuals of the 
domain being considered, as well as a symbol for equality and symbols 
for the relations and operations in question. First-order logic, sometimes 
called also predicate logic, is the part of logic that subjects properties of 
such relations and operations to logical analysis. 


Linguistic particles such as “for all” and “there exists” (called quantifiers) 
play a central role here, whose analysis should be based on a well prepared 
semantic background. Hence, we first consider mathematical structures 
and classes of structures. Some of these are relevant both to logic (in 
particular model theory) and to computer science. Neither the newcomer 
nor the advanced student needs to read all of 2.1, with its mathemati- 
cal flavor, at once. The first five pages should suffice. The reader may 
continue with 2.2 and later return to what is needed. 


Next we home in on the most important class of formal languages, 
the first-order languages, also called elementary languages. Their main 
characteristic is a restriction of the quantification possibilities. We discuss 
in detail the semantics of these languages and arrive at a notion of logical 
consequence from arbitrary premises. In this context, the notion of a 
formalized theory is made more precise. 


Al 
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Finally, we treat the introduction of new notions by explicit definitions 
and other expansions of a language, for instance by Skolem functions. 
Not until Chapter 3 do we talk about methods of formal logical deduc- 
tion. While a multitude of technical details have to be considered in this 
chapter, nothing is especially profound. Anyway, most of it is important 
for the undertakings of the subsequent chapters. 


2.1 Mathematical Structures 


By a structure A we understand a nonempty set A together with certain 
distinguished relations and operations of A, as well as certain constants 
distinguished therein. The set A is also termed the domain of A, or its 
universe. The distinguished relations, operations, and constants are called 
the (basic) relations, operations, and constants of A. A finite structure 
is one with a finite domain. An easy example is ({0,1},~, v, 7). Here 
A, V, — have their usual meanings on the domain {0,1}, and no distin- 
guished relations or constants occur. An infinite structure has an infinite 
domain. A = (N,<,+,-,0,1) is an example with the domain N; here 
<, +, -, 0, 1 have again their ordinary meaning. 

Without having to say so every time, for a structure A the correspond- 
ing letter A will always denote the domain of A; similarly B denotes the 
domain of B, etc. If A contains no operations or constants, then A is also 
called a relational structure. If A has no relations it is termed an algebraic 
structure, or simply an algebra. For example, (Z, <) is a relational struc- 
ture, whereas (Z, +,0) is an algebraic structure, the additive group Z (it is 
customary to use here the symbol Z as well). Also the set of propositional 
formulas from 1.1 can be understood as an algebra, equipped with the 
operations (a, 3) + (aa), (a, 8) (av B), and a+ 7a. Thus, one 
may speak of the formula algebra F whenever it is useful to do so. 

Despite our interest in specific structures, whole classes of structures 
are also often considered, for instance the classes of groups, rings, fields, 
vector spaces, Boolean algebras, and so on. Even when initially just a 
single structure is viewed, call it the paradigm structure, one often needs 
to talk about similar structures in the same breath, in one language, so 
to speak. This can be achieved by setting aside the concrete meaning of 
the relation and operation symbols in the paradigm structure and consid- 
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ering the symbols in themselves, creating thereby a formal language that 
enables one to talk at once about all structures relevant to a topic. Thus, 
one distinguishes in this context clearly between denotation and what is 
denoted. To emphasize this distinction, for instance for A = (A,+,<,0), 
it is better to write A = (A, +4, <4,04), where +4, <4, and 04 mean 
the relation, operation, and constant denoted by +, <, and 0 in A. Only 
if it is clear from the context what these symbols denote may the super- 
scripts be omitted. In this way we are free to talk on the one hand about 
the structure A, and on the other hand about the symbols +, <, 0. 

A finite or infinite set D resulting in this way, consisting of relation, 
operation, and constant symbols of a given arity, is called an extralogical 
signature. For the class of all groups (see page 47), L = {0,e} exemplifies 
a favored signature; that is, one often considers groups as structures of 
the form (G,°o,e), where o denotes the group operation and e the unit 
element. But one can also define groups as structures of the signature 
{co}, because e is definable in terms of o, as we shall see later. Of course, 
instead of o, another operation symbol could be chosen such as -, *, or +. 
The latter is mainly used in connection with commutative groups. In this 
sense, the actual appearance of a symbol is less important; what matters 
is its arity. r € DL always means that r is a relation symbol, and f € L 
that f is an operation symbol, each time of some arity n > 0, which of 
course depends on the symbols r and f, respectively. ! 

An L-structure is a pair A = (A, L“), where L+ contains for every r € L 
a relation r4 on A of the same arity as r, for every f € L an operation 
f4 on A of the arity of f, and for every c € L a constant c4 € A. We 
may omit the superscripts, provided it is clear from the context which 
operation or relation on A is meant. We occasionally shorten also the 
notation of structures. For instance, we sometimes speak of the ring Z or 
the field R provided there is no danger of misunderstanding. 


Every structure is an L-structure for a certain signature, namely that 
consisting of the symbols for its relations, functions, and constants. But 
this does not make the name L-structure superfluous. Basic concepts, such 


‘Here r and f represent the general case and look different in a concrete situation. 
Relation symbols are also called predicate symbols, in particular in the unary case, 
and operation symbols are sometimes called function symbols. In special contexts, 
we also admit n = 0, regarding constants as 0-ary operations. 
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as isomorphism and substructure, refer to structures of the same signature. 
From 2.2 on, once the notion of a first-order language of signature DL has 
been defined, we mostly write £-structure instead of L-structure. We will 
then also often say that r, f, or c belongs to £ instead of L. 

If A C B and f is an n-ary operation on B then A is closed under f, 
briefly f-closed, if fa € A for alld € A”. If n = 0, that is, if f isa 
constant c, this simply means c € A. The intersection of any nonempty 
family of f-closed subsets of B is itself f-closed. Accordingly, we can talk 
of the smallest (the intersection) of all f-closed subsets of B that contain 
a given subset & C B. All of this extends in a natural way if f is here 
replaced by an arbitrary family of operations of B. 


Example. For a given positive m, the set mZ := {m-n|n © Z} of 
integers divisible by m is closed in Z under +, —, and -, and is in fact the 
smallest such subset of Z containing m. 


The restriction of an n-ary relation r? C B" to a subset A C B is 
r4 = r? 4 A”. For instance, the restriction of the standard order of 
R to N is the standard order of N. Only because of this fact can the 
same symbol be used to denote these relations. The restriction f4 of an 
operation f? on B to a set A C B is defined analogously whenever A is 
f-closed. Simply let f4@ = f?a for @ € A”. For instance, addition in N 
is the restriction of addition in Z to N, or addition in Z is an extension of 
this operation in N. Again, only this state of affairs allows us to denote 
the two operations by the same symbol. 

Let 6 be an L-structure and let A C B be nonempty and closed under 
all operations of B; this will be taken to include @ € A for constant 
symbols c € L. To such a subset A corresponds in a natural way an L- 
structure A = (A, L“), where r4 and f“ for r, f € L are the restrictions 
of r8 respectively f8 to A. Finally, let c4 = c8 for c € L. The structure A 
so defined is then called a substructure of B, and BG is called an extension 
of A, in symbols A C B. This is a certain abuse of C but it does not 
cause confusion, since the arguments indicate what is meant. 

A C B implies A C B but not conversely, in general. For example, 
A = (N,<,+,0) is a substructure of B = (Z,<,+,0) since N is closed 
under addition in Z and 0 has the same meaning in A and B. Here 
we dropped the superscripts for <, +, and 0 because there is no risk of 
misunderstanding. 
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A nonempty subset G of the domain B of a given L-structure B defines 
a smallest substructure A of 6 containing G. The domain of A is the 
smallest subset of B containing G and closed under all operations of B. 
A is called the substructure generated from G in B. For instance, 3N 
(= {3n|n € N}) is the domain of the substructure generated from {3} in 
(N,+,0), since 3N contains 0 and 3, is closed under +, and is clearly the 
smallest such subset of N. A structure A is called finitely generated if for 
some finite G C A the substructure generated from G in A coincides with 
A. For instance, (Z,+,—,0) is finitely generated by G = {1}. 


If Ais an L-structure and Lo C L then the Lo-structure Ag with domain 


A and where s“° = s 


A for all symbols s € Lo is termed the Lo-reduct of 
A, and A is called an L-expansion of Ag. For instance, the group (Z, +, 0) 
is the {+,0}-reduct of the ordered ring (Z, <,+,-,0). The notions reduct 
and substructure must clearly be distinguished. A reduct of A has always 
the same domain as A, while the domain of a substructure of A is as a 


rule a proper subset of A. 


Below we list some frequently cited properties of a binary relation < 
in a set A. It is convenient to write a <b instead of (a,b) €<, anda fb 
for (a,b) €<. Just as a < b < c often stands for a < b & b < c, we 
writea db<dcforadb&b<c. In the listing below, ‘for all a’ and 
‘there exists an a’ respectively mean ‘for all a € A’ and ‘there exists some 
a € A’. The relation < C A? is called 


reflexive if a<a for alla, 

irreflexive if a fa for all a, 

symmetric if adb => bda, for all a,b, 
antisymmetric if adbda => a=b, for all a,b, 
transitive if adbdc => acc, for all a,b,c, 
connex if a=bora<bor b<a, for all a,b. 


Reflexive, transitive, and symmetric relations are also called equivalence 
relations. These are often denoted by ~, ~, =, ~, or similar symbols. 
Such a relation generates a partition of its domain whose parts, consisting 
of mutually equivalent elements, are called equivalence classes. 

We now present an overview of classes of structures to which we will 
later refer, mainly in Chapter 5. Hence, for the time being, the beginner 
may skip the following and jump to 2.2. 
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1. Graphs, partial orders, and orders. A relational structure (A, <j) 
with < C A? is often termed a (directed) graph. If <1 is irreflexive and 
transitive we usually write < for < and speak of a (strict) partial order or 
a partially ordered set, also called a poset for short. If we define x < y by 
x<yorxz=y, then < is reflexive, transitive, and antisymmetric, called 
a reflexive partial order of A, the one that belongs to <. Call (A,<) a 
preorder, if < is reflexive and transitive but not necessarily antisymmetric. 
Preorders are frequently met in real life, e.g., 7 is less or equally expensive 
as y. If x ~ y means x < y < & then ~ is an equivalence relation on A. 

A connex partial order A = (A,<) is called a total or linear order, 
briefly termed an order or an ordered or a strictly ordered set. N, Z, Q, 
R are examples with respect to their standard orders. Here we follow the 
tradition of referring to ordered sets by their domains only. 

Let U be a nonempty subset of some ordered set A such that for all 
abe A,a<beUSaecu. Such aU is called an initial segment of A. 
In addition, let V := A\U #0. Then the pair (U,V) is called a cut. The 
cut is said to be a gap if U has no largest and V no smallest element. 
However, if U has a largest element a, and V a smallest element b, then 
(U,V) is called a jump. In this case 6 is called the immediate successor 
of a, and a the immediate predecessor of b, because there is no element 
from A between a and b. An infinite ordered set without gaps and jumps 
like R, is said to be continuously ordered. Such a set is easily seen to be 
densely ordered, i.e., between any two elements lies another one. 

A totally ordered subset K of a partially ordered set H is called a chain 
in H. Such a K is said to be bounded (to the above) if there is some b € H 
with a < b for alla € K. Callc € H maximal in H if no a € A exists 
with a > c. An infinite partial order need not have maximal elements, 
nor need all chains be bounded, as is seen by the example (N,<). With 
these notions, a basic mathematical tool can now be stated: 

Zorn’s lemma. If every chain in a nonempty poset H is bounded then 
HT has a maximal element. 

A (totally) ordered set A is well-ordered if every nonempty subset of 
A has a smallest element; equivalently, there are no infinite decreasing 
sequences ag > a1 >--- of elements from A. Clearly, every finite ordered 
set is well-ordered. The simplest example of an infinite well-ordered set is 
N together with its standard order. 
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2. Groupoids, semigroups, and groups. Algebras A = (A,o) with 
an operation o: A? — A are termed groupoids. If is associative then A 
is called a semigroup, and if o is additionally invertible, then A is said 
to be a group. It is provable that a group (G,°) in this sense contains 
exactly one unit element, that is, an element e such that roe =eox=2 
for all x € G, also called a neutral element. A well-known example is the 
group of bijections of a set M. If the group operation © is commutative, 
we speak of a commutative or abelian group. 

Here are some examples of semigroups that are not groups: (a) the 
set of strings over some alphabet A with respect to concatenation, the 
word-semigroup or free semigroup generated from A. (b) the set MM 
of mappings from M to itself with respect to composition. (c) (N,+) 
and (N,-); these two are commutative semigroups. With the exception of 
(M™,o), all mentioned examples of semigroups are regular, which means 
voz=yozor zox = zoy implies x = y, for all x,y, z of the domain. 

Substructures of semigroups are again semigroups. Substructures of 
groups are in general only semigroups, as seen from (N,+) C (Z,+). Not 
so in the signature {o,e,~!}, where e denotes the unit element and a7! 
the inverse of x. Here all substructures are indeed subgroups. The reason 
is that in {o,e,~!}, the group axioms can be written as universally quan- 
tified equations, where for brevity, we omit the writing of “for all «x, y, z,” 


1 


namely as ro(yoz) =(xoy)oz, roe =2, rox =e. These equations 


certainly retain their validity in the transition to substructures, and they 


| — ¢ for all x, although o is 


imply eo = roe =x and x og = gon7 
not a commutative operation on its domain, in general. 

Ordered semigroups and groups possess along with o some order, with 
respect to which © is monotonic in both arguments, like (N,+,0,<). A 
commutative ordered semigroup (A,+,0,<) with neutral (or zero) ele- 
ment 0 that at the same time is the smallest element in A, and where 
x < y iff there is some z with «+ z= y, is called a domain of magnitude. 


Everyday examples are the domains of length, mass, money, etc. 


3. Rings and fields. These belong to the commonly known structures. 
Below we list the axioms for the theory Tr of fields in +,-,0,1. A fteld 
is a model of Tr. A ring is a model of the axiom system Tp for rings 
that derives from Tr by dropping the axioms N*, C*, and I*, and the 
constant 1 from the signature. Here are the axioms of Tr: 
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N': #2+0=¢2 I ae 
CT: gx+y=yt2 Oa 
AT: (a+y)+2=24+(y+z) A®: 
D: a-(ytz)=a-ytau-2 Do: (yt2z):t=y rte 
I”: Vadtyx+y=0 I: 041A (V2z¥40)4yx-y=1 


In view of C*, axiom D’ is dispensable for Tr but not for Tr. When 
removing It from Tr, we obtain the theory of semirings. A well-known 
example is (N,+,-,0). A commutative ring that has a unit element 1 


but no zero-divisor (i.e., ~Aady(x2,y #0 x -y=0) is called an integral 
domain. A typical example is (Z,+,-,0,1). 

Let K,K’ be any fields with K Cc K’. We call a € K'\K algebraic or 
transcendental on K, depending on whether a is a zero of a polynomial 


with coefficients in K or not. If every polynomial of degree > 1 with 
coefficients in K breaks down into linear factors, as is the case for the 
field of complex numbers, then XK is called algebraically closed, in short, K 
is a.c. These fields will be more closely inspected in 3.3 and Chapter 5. 
Each field K has a smallest subfield P, called a prime field. One says that 
K has characteristic 0 or p (a prime number), depending on whether P is 
isomorphic to the field Q or the finite field of p elements. No other prime 
fields exist. It is not hard to show that K has the characteristic p iff the 
sentence char, : 1+---+1=0 holds in K. 
~~ -— 


Pp 
Rings, fields, etc. may also be ordered, whereby the usual monotonicity 


laws are required. For example, (Z,<,+,-,0,1) is the ordered ring of 
integers and (N,<,+,-,0,1) the ordered semiring of natural numbers. 


4. Semilattices and lattices. A = (A,°) is called a semilattice if o is 
associative, commutative, and idempotent. An example is ({0,1},°) with 
o= a. If we define a < b: acb=a then < is a reflexive partial order 
on A. Reflexivity holds, since aca = a. As can be easily verified, aob 
is in fact the infimum of a,b with respect to <, acb = inf{a,b}, that is, 
achb<a,b, andc<a,b>c<aoeb, for all a,b,cE A. 

A = (A,n, vu) is called a lattice if (A, 1) and (A, U) are both semi- 
lattices and the following so-called absorption laws hold: an(aub) = a 
and au(anb) = a. These imply anb = a © aub = b. As above, 
a <b: anb =a defines a partial order such that anb = inf{a,b}. In 
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addition, one has aub = sup{a,b} (the supremum of a,b), which is to 
mean a,b < aub, anda,jb<c => aub<ce, for alla,bc € A. If A 
satisfies, moreover, the distributive laws an(buc) = (anb)u(anc) and 
au(bnc) = (aub)n(auc), then A is termed a distributive lattice. For 
instance, the power set 8M with the operations N and U for n and u re- 
spectively is a distributive lattice, as is every nonempty family of subsets 
of M closed under M and U, a so-called lattice of sets. Another important 
example is (N, gcd, lcm). Here gcd(a, b) and lem(a, b) denote the greatest 
common divisor and the least common multiple of a,b € N. 


5. Boolean algebras. An algebra A = (A, 1, U,7) where (A, n, vu) is 
a distributive lattice and in which at least the equations 
=-~—a=8, aland)=—2ub, on=@—tn-=b 

are valid is called a Boolean algebra. The paradigm structure is the two- 
element Boolean algebra 2 := ({0,1},~,v,—), with nm, vu interpreted as 
A,V, respectively. One defines the constants 0 and 1 by 0 := an-a for 
any a € A and 1:=—0. There are many ways to characterize Boolean 
algebras A, for instance, by saying that A satisfies all equations valid in 2. 
The signature can also be variously selected. For example, the signature 
A,V,— is well suited to deal algebraically with two-valued propositional 
logic. Terms of this signature are, up to the denotation of variables, 
precisely the Boolean formulas from 1.1, and a valid logical equivalence 
a = @ corresponds to the equation a = (, valid in 2. Further examples 
of Boolean algebras are the algebras of sets A = (A,M,U,—7). Here A 
consists of a nonempty system of subsets of a set I, closed under M, U, 
and — (complementation in J). These are the most general examples; a 
famous theorem, Stone’s representation theorem, says that each Boolean 
algebra is isomorphic to an algebra of sets. 


6. Logical L-matrices. These are structures A = (A, L4, D4), where 
L contains only operation symbols (the “logical” symbols) and D denotes 
a unary predicate, the set of distinguished values of A. Best known is the 
two-valued Boolean matrix B = (2,D®) with D® = {1}. The consequence 
relation F4 in the propositional language F of signature LD is defined as 
in the two-valued case: Let X C ¥ and y € ¥. Then X Fy y if wy € D4 
for every w: PV — A with wX C D4 (wX := {wa|a € X}). In words, 
if the values of all a € X are distinguished, then so too is the value of yp. 
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Homomorphisms and isomorphisms. The following notions are im- 
portant for both mathematical and logical investigations. Much of the 
material presented here will be needed in Chapter 5. In the following 
definition, n (>0) denotes as always the arity of f or r. 


Definition. Let A,B be L-structures and h: A — B (strictly speaking 
h: A — B) a mapping such that for all f,c,r € LD and @€ A”, 

(Bye Ayes Phe, Aa Ma She (ha = (iain) 
Then h is called a homomorphism. If the third condition in (H) is replaced 
by the stronger condition (S): (Abe A")(h@=hb & r4b) & rBh& 2 then h 
is said to be a strong homomorphism. For algebras, the word “strong” 


is clearly dispensable. An injective strong homomorphism h: A — B is 
called an embedding of A into B. If, in addition, h is bijective then h is 
called an isomorphism, and in case A = B, an automorphism. 


An embedding or isomorphism h: A — B satisfies r4@ = r® ha. Indeed, 
since hd=hb & a=, (S) yields r2h@ > (SbEA")(@=b & r4b) => rz. 
A, B are said to be isomorphic, in symbols A ~ B, if there is an isomor- 


phism from A to B. It is readily verified that ~ is reflexive, symmetric, and 
transitive, hence an equivalence relation on the class of all L-structures. 


Examples 1. (a) A valuation w considered in 1.1 can be regarded as 
a homomorphism of the propositional formula algebra F into the two- 
element Boolean algebra 2. Such a w:F — 2 is necessarily onto. 

(b) Let A = (A, *) be a word semigroup with the concatenation operation 
x and B the additive semigroup of natural numbers, considered as L- 
structures for L = {0} with o4 = * and 0% = +. Let lh(€) denote the 
length of a word or string € € A. Then € + lh(&) is a homomorphism 
since lh(€ * 7) = lh(€) + lh(7), for all €,7 € A. If A is generated from a 
single letter, lh is evidently bijective, hence an isomorphism. 

(c) The mapping a + (a,0) from R to C (= set of complex numbers, 
understood as ordered pairs of real numbers) is a good example of an 
embedding of the field R into the field C. Nonetheless, we are used to 
saying that R is a subfield of C, and that R is a subset of C. 


2 (4bEA”")(h@=hb & r4b) abbreviates ‘there is some 6 € A” with hd = hb and r4o’. 
If h: A — B is onto (and only this case will occur in our applications) then (S) is 
equivalent to the more suggestive condition r° = {ha@| ra}. 
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(d) Let A = (R,+,<) be the ordered additive group of real numbers 
and B = (R,,+,<) the multiplicative group of positive reals. Then for 
any 6 € R+\{1} there is precisely one isomorphism 7:A — 6 such that 
nl = b, namely 7:2 +> 6b”, the exponential function exp, to the base b. 
It is even possible to define exp, as this isomorphism, by first proving 
that—up to isomorphism—there is only one continuously ordered abelian 
group (first noticed in |'Ta2] though not explicitly put into words). 

(ec) The algebras A = ({0,1},+) and B = ({0,1},<) are only apparently 
different, but are in fact isomorphic, with the isomorphism 6 where 60 = 1, 
61 = 0. Thus, since A is a group, 6 is a group as well, which is not 
obvious at first glance. By adjoining the unary predicate D = {1}, A and 
B become (nonisomorphic) logical matrices. These actually define the two 
“dual” fragmentary two-valued logics for the connectives either... or..., 
and ... if and only if ..., which have many properties in common. 


Congruences. A congruence relation (or simply a congruence) in a struc- 
ture A of signature L is an equivalence relation ~ in A such that for all 
n> 0, all f € L of arity n, and all a, be A”, 
axb > fAd~ fro. 

Here @ © b means a; © 0; for i = 1,...,n. A trivial example is the 
identity in A. If h: A — B is a homomorphism then =, C A?, defined 
by a &, b & ha = hb, is a congruence in A, named the kernel of h. Let 
A’ be the set of equivalence classes a/¥ := {x € Ala x} fora e A, 
also called the congruence classes of ©, and set @/® := (a1/*%,...,@n/*) 
for @€ A". Define f4’(@/x) := (f4@)/% and let r4’@/x + (Jbx@)rAb. 
These definitions are sound, that is, independent of the choice of the n- 


tuple d of representatives. Then A’ becomes an L-structure A’, the factor 
structure of A modulo ~, denoted by A/. Interesting and useful, e.g. in 
5.7, is the following very general and easily provable 


Homomorphism theorem. Let A be an L-structure and © a congruence 
in A. Then k:a+> a/®& is a strong homomorphism from A onto A/x%, 
the canonical homomorphism. Conversely, if h: A— B is a strong homo- 
morphism from A onto an L-structure B with kernel = thenr: a/% > ha 
is an isomorphism from A/x to B, and h= ick. 

Proof. We omit here the superscripts for f and r just for the sake of 
legibility. Clearly, kfa@ = (fa)/~ = f(@/~) = fka (=f Reizcssphta) 
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and (4bEA") (ka = kb & rb) & (Abx@)rb & ra/x & r ka by definition. 
Hence k is what we claimed. The definition of 2 is sound, and 2 is bijective 
since ha = hb > a/ = b/x&. Furthermore, 2 is an isomorphism because 


if (G/~) =hfa= fha = fr(@/x) and ra/~ & rhé Ss rr(a/x). 
Finally, h is the composition 2°k by the definitions of 1 and k. (J 


Remark. For algebras A, this theorem is the usual homomorphism theorem of 
universal algebra. A/* is then named the factor algebra. The theorem covers 
groups, rings, etc. In groups, the kernel of a homomorphism is already deter- 
mined by the congruence class of the unit element, called a normal subgroup, 
in rings by the congruence class of 0, called an ideal. Hence, in textbooks on 
basic algebra the homomorphism theorem is separately formulated for groups 
and rings, but is easily derivable from the general theorem present here. 


Direct products. These provide the basis for many constructions of new 
structures, especially in 5.7. A well-known example is the n-dimensional 
vector group (R”,0,+). This is the n-fold direct product of the group 
(R,0,+) with itself. The addition in R” is defined componentwise, as is 
also the case in the following 


Definition. Let (A;)jez be a nonempty family of L-structures. The 
direct product B = |],-; A; is the structure defined as follows: Its domain 
is B= [hex A;, called the direct product of the sets A;. The elements 
a = (a;)ier of B are functions defined on I with a; € A; for each i € I. 
Relations and operations in B are defined componentwise, that is, 
rba o rAd; forallicl, fPa=(fAdi)ier, c8 = (c*)ier, 

where @ = (a',...,a”) € B” (here the superscripts count the components) 
with a’ := (a”);er for vy = 1,....,n, and @& := (a},...,a7) € AP. 

Whenever A; = A for all i € J, then [J;<; A; is denoted by A! and 
called a direct power of the structure A. Note that A is embedded in A! 
by the mapping a +> (a)je7, where (a)jez denotes the J-tuple with the 
constant value a, that is, (a)iey = (a,a,...). For J = {1,...,m}, the 
product [],-, A; is also written as A; x --- x Am. If I = {0,...,n—1} 
one mostly writes A” for A‘. 

Examples 2. (a) Let I = {1,2}, A; = (Ai,<‘), and B = []je; Ai. 
Thena <P bSa, <! by & ag <? by, for all a,b € B= A; x Ao. Note 
that if A,,A>2 are ordered sets then B is only a partial order. The deeper 
reason for this observation will become clear in Chapter 5. 
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(b) Let B = 2! be a direct power of the two-element Boolean algebra 2. 
The elements a € B are I-tuples of 0 and 1. These uniquely correspond 
to the subsets of J via the mapping z:a +> Ig := {ie I|a; = 1}. Asa 
matter of fact, 2 is an isomorphism from B to (®J,N,U,-), as can readily 
be verified; Exercise 4. 


Exercises 


1. Show that there are (up to isomorphism) exactly five two-element 
proper groupoids. Here a groupoid (H,-) is termed proper if the 
operation + is essentially binary. 


2. x (C A?) is termed Euclidean ifaxb&axc => bce, for all 
a,b,c € A. Show that * is an equivalence relation in A if and only 
if © is reflexive and Euclidean. 


3. Prove that an equivalence relation ~ on an algebraic L-structure A 


is a congruence iff for all f € LD of arity n, alli = 1,...,n, and all 
! . ~ tf 
Q1,---,4j-1,4,@',Qj41,---,@n € Awithara’, 
ae, ! 
F(@i,.. +, Qi-1, @, Qi41,---,@n) © f(@i,-.-,@i-1,@, @i41,+++, Qn). 


4. Prove in detail that 2/ ~ ($J,N,U,—) for a nonempty index set I. 
Prove the corresponding statement for any subalgebra of 2/. 


5. Show that h: [[,<; Ai — Aj with ha = a; is a homomorphism for 
each j € I. 


2.2 Syntax of First-Order Languages 


Standard mathematical language enables us to talk precisely about struc- 
tures, such as the field of real numbers. However, for logical (and meta- 
mathematical) issues it is important to delimit the theoretical framework 
to be considered; this is achieved most simply by means of a formalization. 
In this way one obtains an object language; that is, the formalized elements 
of the language, such as the components of a structure, are objects of our 
consideration. To formalize interesting properties of a structure in this 
language, one requires at least variables for the elements of its domain, 
called individual variables. Further are required sufficiently many logical 
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symbols, along with symbols for the distinguished relations, functions, 
and constants of the structure. These extralogical symbols constitute the 
signature DL of the formal language £ that we are going to define. 


In this manner one arrives at the first-order languages, also termed 
elementary languages. Nothing is lost in terms of generality if the set 
of variables is the same for all elementary languages; we denote this set 
by Var and take it to consist of the countably many symbols vo, v1,... 
Two such languages therefore differ only in the choice of their extralogical 
symbols. Variables for subsets of the domain are consciously excluded, 
since languages containing variables both for individuals and sets of these 
individuals—second-order languages, discussed in 3.8—have different se- 
mantic properties from those investigated here. 


We first determine the alphabet, the set of basic symbols of a first-order 
language £ defined by a signature L. It includes, of course, the already 
specified variables vo, v1,... In what follows, these will mostly be denoted 
by x,y, Z, u,v, though sometimes other letters with or without indices may 
serve the same purpose. The boldface printed original variables are useful 
in writing down a formula in the variables v;,,...,vi,,, for these can then 
be denoted, for instance, by v1,...,Un, or by %1,...,2n. 

Further, the logical symbols ~ (and), — (not), V (for all), the equality 
sign =, and, of course, all extralogical symbols from L should belong to 
the alphabet. Note that the boldface symbol = is taken as a basic symbol; 
simply taking the equality symbol = could lead to unintended mix-ups 
with the ordinary use of = (in Chapter 4 also identity-free languages 
without = will be considered). Finally, the parentheses (, ) are included 
in the alphabet. Other symbols are introduced by definition, e.g., v, +, 


are defined as in 1.4 and the symbols 4 (there exists) and 4! (there exists 
exactly one) will be defined later. Let S¢ denote the set of all strings 
made up of symbols that belong to the alphabet of L. 


From the set 8,¢ of all strings we pick out the meaningful ones, namely 
terms and formulas, according to certain rules. A term, under an inter- 
pretation of the language, will always denote an element of a domain, 
provided an assignment of the occurring variables to elements of that do- 
main has been given. In order to keep the syntax as simple as possible, 
terms will be understood as certain parenthesis-free strings, although this 
kind of writing may look rather unusual at the first glance. 
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Terms in L: 


(T1) Variables and constants, considered as atomic strings, are terms, 
also called prime terms. 


(T2) If f € Lis n-ary and t),...,t, are terms, then ft,---t, is a term. 


This is a recursive definition of the set of terms as a subset of 5¢. Any 
string that is not generated by (T1) and (T2) is not a term in this context 
(cf. the related definition of F in 1.1). Parenthesis-free term notation 
simplifies the syntax, but for binary operations we proceed differently in 
practice and write, for example, the term -+ryz as (x+y)+z. The reason 
is that a high density of information in the notation complicates read- 
ing. Our brain does not process information sequentially like a computer. 
Officially, terms are parenthesis-free, and the parenthesized notation is 
just an alternative way of rewriting terms. Similarly to the unique recon- 
struction property of propositional formulas in 1.1, here the unique term 
reconstruction property holds, that is, 

ftr-:++tn = fs1-++ Sn implies 8; = t; fori=1,...,n (tj, 5; terms), 
which immediately follows from the unique term concatenation property 

ty: -++ty = 81°°+ Sm impliesn =m and t; = s; fori =1,...,n. 
The latter is shown in Exercise 2. T (= J,) denotes the set of all terms 
of a given signature L. The set J can be understood as an algebra with 
the operations given by f7 (t1,...,tn) = ftr-++tn, called the term algebra. 
Variable-free terms, which exist only with the availability of constant sym- 
bols, are called constant terms or ground terms, mainly in logic program- 
ming. From the definition of terms immediately follows the useful 


Principle of proof by term induction. Let E be a property of strings 
such that E holds for all prime terms, and for each n > 0 and each n-ary 
function symbol f, the assumptions Eti,...,Etn imply Efty---tn. Then 
all terms have the property E. 


Indeed, T is by definition the smallest set of strings satisfying the condi- 
tions of this principle, and hence a subset of the set of all strings with the 
property €. A simple application of term induction is the proof that each 
compound term t is a function term in the sense that t = ft,---t, for 
some n-ary function symbol f and some terms t1,..., tn. Simply consider 
the property ‘t is either prime or a function term’. Term induction can 
also be executed on certain subsets of J, for instance on ground terms. 
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We also have at our disposal a definition principle by term recursion 
which, rather than defining it generally, we present through examples. 
The set vart of variables occurring in a term t is recursively defined by 


varc=( : vara = {x} > var fty---ty, = varty U---U vartn. 


vart, and even var€ for any € € 8 , can also be defined explicitly using 
concatenation. var€ is the set of all « € Var for which there are strings 
y, 0 with € = nxv. The notion of a subterm of a term can also be defined 
recursively. Again, we can also do it more briefly using concatenation. 
Definition by term induction should more precisely be called definition by 
term recursion. But most authors are sloppy in this respect. 

We now define recursively those strings from 8,¢ to be called formulas, 
also termed expressions or well-formed formulas of signature L. 


Formulas in L: 
(F1) If s,¢ are terms, then the string s=¢ is a formula. 


(F2) Ifti,...,t, are terms and r € L is n-ary, then rt; --- ty is a formula. 


(F3) If a,@ are formulas and z is a variable, then (a3), 7a, and Vra 
are formulas. 


Any string not generated according to (F1), (F2), (F3) is in this context 
not a formula. Other logical symbols serve throughout merely as abbre- 


viations, namely dra := =Var-7a, (a v B) := a(-aa-78), and as in 1.1, 
(a > 8B) := 7A(an7B), and (a © 8) := ((a +8)A(G —-a)). In addition, 
st will throughout be written for >s=t. The formulas Vra and dra 


are said to arise from a@ by quantification. 


Examples. (a) Vredyxz + y=0 (more explicitly, VeaVyn xz + y=0) isa 
formula, expressing ‘for all x there exists a y such that x+y = 0’. Here we 
assume tacitly that x,y denote distinct variables. The same is assumed 
in all of the following whenever this can be made out from the context. 
(b) VaVax=y is a formula, since repeated quantification of the same 
variable is not forbidden. Vza=y is a formula also if z 4 x,y, although 
z does then not appear in the formula «= y. 


Example (b) indicates that the grammar of our formal language is more 
liberal than one might expect. This will spare us a lot of writing. The for- 


mulas VaVa2 x= y and Jx¢Va x= y both have the same meaning as Vz x= y. 
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These three formulas are logically equivalent (in a sense still to be defined), 
as are Vzx=y and x=y for z distinct from x and y. It would be to our 
disadvantage to require any restriction here. In spite of this liberality, the 
formula syntax corresponds roughly to the syntax of natural language. 

The formulas procured by (F1) and (F2) are said to be prime or atomic 
formulas, or simply called prime. As in propositional logic, prime formulas 
and their negations are called literals. 

Prime formulas of the form s=¢t are called equations. These are the 
only prime formulas if Z contains no relation symbols, in which case L 
is called an algebraic signature. Prime formulas that are not equations 
begin with a relation symbol, although in practice a binary symbol tends 
to separate the two arguments as, for example, in « < y. The official 
notation is, however, that of clause (F2). The unique term concatenation 
property clearly implies the unique prime formula reconstruction property 


rty-+-ty =1751-++ Spy implies t; = s; fori =1,...,n. 


The set of all formulas in L is denoted by £. The case L = @) defines 
the language of pure identity, denoted by L=, whose prime formulas are 
all of the form x= y. If L = {¢} or L = {} then CL is also denoted by L- 
or £,, resp. If Z is more complex, e.g. L = {o,e}, we write L = L{o,e}. 

Instead of terms, formulas, and structures of signature L, we will talk 
of £-terms (writing Tc for J), £-formulas, and £-structures respectively. 
We also omit the prefix if £ has been given earlier and use the same 
conventions of parenthesis economy as in 1.1. We will also allow ourselves 
other informal aids in order to increase readability. For instance, variously 


shaped brackets may be used as in VrdyVz[zey — Ju(zeuaue az). Even 
verbal descriptions (partial or complete) are permitted, as long as the 
intended formula is uniquely recognizable. 


The strings Vx and 4x (read “for all x” respectively “there is an x”) are 


called prefixes. Also concatenations of these such as Vady are prefixes. No 


other prefixes are considered here. Formulas in which V,4 do not occur 
are termed quantifier-free or open. These are the Boolean combinations 
of prime formulas. Generally, the Boolean combinations of formulas from 
a set X C £ are the ones generated by 7, ~ (and v) from those of X. 
X,Y, Z always denote sets of formulas, a, 3, 7,6,7,y~,... denote formu- 
las, and s,t terms, while ©, WV are reserved to denote finite sequences of 
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formulas and formal proofs. Substitutions (to be defined below) will be 
denoted by 0,7,w,p, and ¢. 

Principles of proof by formula induction and of definition by formula 
induction (more precisely formula recursion) also exist for first-order and 
other formal languages. After the explanation of these principles for 
propositional languages in 1.1, it suffices to present here some examples, 
adhering to the maxim verba docent, exempla trahunt. Formula recursion 
is based on the unique formula reconstruction property, which is similar to 
the corresponding in 1.1: Each composed y € £ can uniquely be written 
as p = 7a, y = (anf), or Vra for some a, 3 € Land «x € Var. A simple 
example of a recursive definition is rk y, the rank of a formula y. Starting 
with rk a = 0 for prime formulas 7 it is defined as on page 8, with the 
additional clause rk Vea = rka+1. Functions on £ are sometimes defined 
by recursion on rk y, not on y, as for instance on page 60. 

Useful for some purposes is also the quantifier rank, qr yp. It represents 
a measure of nested quantifiers in y. For prime 7 let qra = 0, and let 
qr-a=qra, qr(arZ) = max{qra,qr 3}, qrVza = qra+1. 


Note that qrday = qr Vay = qrVay. A subformula of a formula is 
defined analogously to the definition in 1.1. Hence, we need say no more 
on this. We write « € bndy (or x occurs bound in w) if y contains the 
prefix Vx. In subformulas of vy of the form Vxa, the formula a is called 
the scope of Vz. The same prefix can occur repeatedly and with nested 
scopes in y, as for instance in Va(Vxr=0 a x<y). In practice we avoid 
this way of writing, though for a computer this would pose no problem. 


Intuitively, the formulas (a) Vaityx + y=0 and (b) dyx + y=0 are 
different in that in every context with a given meaning for + and 0, the 
former is either true or false, whereas in (b) the variable x is waiting to be 
assigned a value. One also says that all variables in (a) are bound, while 
(b) contains the “free” variable x. The syntactic predicate ‘x occurs free 
in y’, or ‘a € freey’ is defined inductively: Let freea = vara for prime 
formulas a (vara was defined on page 56), and 


free(anZ) = freea U free, freema = freea, freeVra = freea\{z}. 


For instance, free (Vady z+ y=0) = 0, while free (x < y a Vasyx+y=0) 
equals {x,y}. As the last formula shows, x can occur both free and bound 
in a formula. This too will be avoided in practice whenever possible. In 
some proof-theoretically oriented presentations, even different symbols are 


2.2 Syntax of First-Order Languages 59 


chosen for free and bound variables. Each of these approaches has its 
advantages and its disadvantages. 

Formulas without free variables are called sentences, or closed formulas. 
1+1=0 and Vadyr+y=0 (= Va-Vy72+ y= 0) are examples. Through- 


out take £° to denote the set of all sentences of £. More generally, let L* 
be the set of all formulas y such that freey C Var, := {vo0,...,Uz_—1}- 
Clearly, £° °C £* C++» and £L=| jag t*. 

At this point we meet a for the remainder of the book valid 
Convention. As long as not otherwise stated, the notation y = (x) 
means that the formula y contains at most x as a free variable; more 
generally, p = (%1,...,£n) or py = y(Z) is to mean freey C {21,..., Zn}, 
where £1,..., 2, stand for arbitrary but distinct variables. Not all of these 
variables need actually occur in y. Further, t = t(#) for terms t is to be 
read completely analogously. 

The term ft,--+tn is often denoted by ft, the prime formula rt, ---ty 
by rt. Here t denotes the string concatenation t,---t,. Fortunately, t 
behaves exactly like the sequence (t1,...,tn) as was pointed out already; 
it has the unique term concatenation property, see page 55. 


Substitutions. We begin with the substitution £ of some term t for a 


single variable x, called a simple substitution. Put intuitively, g £ (also 
denoted by yz(t) and read “y t for x”) is the formula that results from 
replacing all free occurrences of x in wy by the term ¢t. This intuitive 
characterization is made precise recursively, first for terms by 

te=t, ys =y (@#Y), Cr=e, (ft tr)s = ft th, 
where, for brevity, t/ stands for t;£, and next for formulas as follows: 


(t:=t)£ =4=46, (rt)h =rt,---d, ayant -{ ifx=y, 
t 


/ 
2 
(ar B)£ =ab rnBt, (-a)£ =-(a8), Vy(a) otherwise. 
Then also (a>+8)£ =a — @£, and the corresponding holds for v, 
while (Jya)£ = Sya for y = x, and Jy(a£) otherwise. Simple substitu- 


tions are special cases of so-called simultaneous substitutions 


iyort re 
Y ASE (21;.+.52_ distinct), 


For brevity, this will be written gt or yz(t) or just y(t), provided there is 
no danger of misunderstanding. Here the variables x; are simultaneously 
replaced by the terms ¢; at free occurrences. Simultaneous substitutions 
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easily generalize to global substitutions o. Such a o assigns to every 
variable x a term «2° € JT. It extends to the whole of T by the clauses 
c’ = cand (ft)° = ft{---t2%, and subsequently to £L by recursion on 
rk yp, so that o is defined for the whole of TUL: (t)=t2)7 = t{=19, 
(rt)? = rt] ---t2, (aa B)? = a7 AB, (a)? = 7a”, and (Vzy)? = Vary", 
where 7 is defined by x7 = x and y™ = y° for y 4 x.3 

These clauses cover also the case of a simultaneous substitution, because 
3 can be identified with the global substitution o such that x7 = t; 
fori = 1,...,n and x° = x otherwise. In other words, a simultaneous 
substitution can be understood as a global substitution o such that 7° = x 
for almost all variables x, i.e., with the exception of finitely many. The 
identical substitution, always denoted by 1, is defined by x’ = & for all a; 
hence t’ = ¢ and y’ = y for all terms ¢ and formulas y. 


Clearly, a global substitution yields locally, i.e. with respect to individual 
formulas, the same as a suitable simultaneous substitution. Moreover, it 
will turn out below that simultaneous substitutions are products of simple 
ones. Nonetheless, a separate study of simultaneous substitutions is useful 
mainly for Chapter 4. 


It always holds that 22 = #4. whereas the compositions # 2 and 
BR a are distinct, in general. Let us elaborate by explaining the difference 


between yp #2 and pE## (= (~H)#). For example, if one wants 


to swap 21,22 at their free occurrences in y then the desired formula 


is p #3, but not, in general, y = = (choose for instance y = 41<22). 


Rather »p 23 = 9 4 BF for any y ¢ var pU{z1, £9}, as is readily shown 


by induction on y after first treating terms. We recommend to carry out 


this induction in detail. In the same way we obtain 


(1) = pH fe aur ue (y ¢ vary U var%U vart, n> 2) 


This formula shows that a simultaneous substitution is a suitable product 
(composition) of simple substitutions. Conversely, it can be shown that 
each such product can be written as a single simultaneous substitution. 
In some cases (1) can be simplified. Useful, for example, is the following 
equation which holds in particular when all terms t; are variable-free: 


2) vh=ph-- (wig varty fori #3). 


3 Since rk y < rk Vxy, we may assume according to the recursive construction of o that 
a 


y" is already defined for all global substitutions rT. 
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Getting on correctly with substitutions is not altogether simple; it 
requires practice, because our ability to regard complex strings is not 
especially trustworthy. A computer is not only much faster but also more 
reliable in this respect. 


Exercises 


1. Show by term induction that a terminal segment of a term ¢ is a 
concatenation s1--- 5, of terms s; for some m > 1. Thus, a symbol 
in ¢ is at each position in ¢ the initial symbol of a unique subterm s 
of t. The uniqueness of s is an easy consequence of Exercise 2(a). 


2. Let £ be a first-order language, JT = Tz, and Et the property ‘No 
proper initial segment of t (€ J) is a term, nor is ¢ a proper initial 
segment of a term from TJ’. Prove (a) Et for all t € T, hence 
té= é’ > t =7 for all t,t’ € T and arbitrary €,€’ € 8 ¢, and 
(b) the unique term concatenation property (page 55). 


3. Prove (a) No proper initial segment of a formula y is a formula. 
(b) The unique formula reconstruction property stated on page 58. 
(c) pF €L>EELanda,(ank)ELSEECL. (c) easily yields 
(d) a,(a>£) €LS>E€EL, for all € € $e. 

4. Prove that y£ = for x ¢ freey, and p¥ i =F for y ¢ vary. 
It can even be shown y£ =y 2a ¢ freey whenever t # 2. 


5. Let X C £L be a nonempty formula set and X* = XU{7y|y eE X}. 
Show that a Boolean combination of formulas from X is equivalent 
to a disjunction of conjunctions of formulas from X*. 


2.3. Semantics of First-Order Languages 


Intuitively it is clear that the formula Jy y+y=<z can be allocated a truth 
value in the domain (N, +) only if to the free variable x there corresponds a 
value in N. Thus, along with an interpretation of the extralogical symbols, 
a truth value allocation for a formula y requires a valuation of at least the 
variables occurring free in y. However, it is technically more convenient 
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to work with a global assignment of values to all variables, even if in a 
concrete case only the values of finitely many variables are needed. We 
therefore begin with the following 


Definition. A model M is a pair (A,w) consisting of an £-structure A 
and a valuation w: Var > A, w:a +> 2”. We denote r4, f4,c4, and 2” 
also by r™, f@, ce, and a™, respectively. The domain of A will also be 


called the domain of M. 


Models are sometimes called interpretations, occasionally also £-models 
if the connection to L is to be highlighted. Some authors identify models 
with structures from the outset. This also happens in 2.5, where we 
are talking about models of theories. The notion of a model is to be 
maintained sufficiently flexible in logic and mathematics. 


A model M allocates in a natural way to every term t a value in A, 
denoted by t™“ or t4” or just by ¢”. Clearly, for prime terms the value is 
already given by M. This evaluation extends to compound terms by term 
induction as follows: (ft)“ = fM@t™, where t™ abbreviates here the 
cece 


.., t/“). If the context allows we neglect the superscripts 


sequence ( 
and retain just an imaginary distinction between symbols and their inter- 
pretation. For instance, if A = (N,+,-,0,1) and x” = 2, say, we write 
somewhat sloppily (0-a2+1)4"% =0-24+1=1. 

The value of t under M depends only on the meaning of the symbols 
that effectively occur in t; using induction on t, the following slightly more 
general claim is obtained: if vart C V C Var and M,™M’ are models with 
the same domain such that a = 2" for all zc € V and s“ = s" for 
all remaining symbols s occurring in t, then t“@ = t™’. Clearly, 4” may 
simply be denoted by t4, provided the term t contains no variables. 

We now are going to define a satisfiability relation F between models 
M = (A,w) and formulas y, using induction on y as in 1.3. We read 
MEwgpasM satisfies y, or M is a model for y. 

Sometimes AF y[w] is written instead of MF y. A similar notation, 
just as frequently encountered, is introduced later. Each of these notations 
has its advantages, depending on the context. If MF yg for all yp € X 
we write M — X and call M a model for X. For the formulation of 
the satisfaction clauses below (taken from |'Tal]) we consider for given 
M = (A,w), x € Var, and a € A also the model M% (generalized to M4 
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below). M& differs from M only in that the variable x receives the value 
a € A instead of «”. Thus, M% = (A,w’) with 2” =a and y” = y” 
otherwise. The satisfaction clauses then look as follows: 

MEs=t & sVvMa=i“, 

ME rt ee MEM, 

ME (arB) & MeEaand ME 8B, 

MF 7a = MFa, 

MEVta & MF aforallae A. 


Remark 1. The last satisfaction clause can be stated differently if a name for 
each a € A, say a, is available in the signature: M F Vra © MF a ¢ for all 
a € A. This assumption permits the definition of the satisfaction relation for 
sentences using induction on sentences while bypassing arbitrary formulas. If 
not every a € A has a name in JL, one could “fill up” LZ in advance by adjoining 
to L aname a for each a. But expanding the language is not always wanted and 
does not really simplify the matter. 


M3 is slightly generalized to M@ := M4t7a (= (M21)@ ...), which 


Lp Ly 


differs from M in the values of a sequence 21,..., 2p of distinct variables. 
This and writing Vry for Vz; ---Vxpy permits a short notation of a useful 
generalization of the last clause above, namely 


ME Ve = MF » for all GE A”. 


The definitions of a v 6, a +6, and a + @ from page 56 readily imply 
the additional clauses MF av 6 iff MEaorME 6B, MEa-p 
if MEa=> ME B, and analogously for «+. Clearly, if v, +,< were 
treated as independent connectives, these equivalences would have to be 


added to the above ones. Further, the definition of dry in 2.2 corresponds 


to its intended meaning, because M F dap = M%F y for some a € A. 
Indeed, whenever M F =Var-7y (= Ary) then M2 F -7y does not hold 
for all a; hence there is some a € A such that M&% F -y, or equivalently, 


M:° — yo. And this chain of reasoning is obviously reversible. 


Example 1. M - dx x2=t for arbitrary M, provided x ¢ vart. Indeed, 
M2 - x=twitha:=™, sincera“: =a =t™ = t® in view of « ¢ vart. 


The assumption x ¢ vart is essential. For instance, M F da a= fx holds 


only if the function f™ has a fixed point. 


We now introduce several fundamental notions that will be treated more 
systematically in 2.4 and 2.5, once certain necessary preparations have 
been completed. 
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Definition. A formula or set of formulas in £ is termed satisfiable if it 
has a model. y € £ is called generally valid, logically valid, or a tautology, 
in short, F y, if M F— @ for every model M. Formulas a, are called 
(logically or semantically) equivalent, in symbols, a = (, if 
MEa & ME Q, for each £-model M. 

Further, let AF » (read y holds in A or A satisfies ~) if (A,w) F » for 
all w: Var > A. One writes AF X in case AF y for all py € X. Finally, 
let X F y (read from X follows y, or p is a consequence of X) if every 
model M of X also satisfies the formula y, i.e, MEX => ME ». 


As in Chapter 1, F denotes both the satisfaction and the consequence 
relation. Here, as there, we write ¥1,..-,%n F y for {y1,..-, Qn} F y. 
Note that in addition, F denotes the validity relation in structures, which 
is illustrated by the following 


Example 2. We show that AF Vrityx # y, where the domain of A 
contains at least two elements. Indeed, let M = (A,w) and let a € A 
be given arbitrarily. Then there exists some 6 € A with a  b. Hence, 
(M3)? = Mer F ay, and so M2 F dyx#y. Since a was arbitrary, 
M FF Vadyx #y. Clearly the actual values of w are irrelevant in this 
argument. Hence (A, w) F Vatyx#y for all w, that is, AF Vasyr¥y. 


Here some care is needed. While M F y or M F - 9 for all formulas, 


AF yor AF 79 (the law of the excluded middle for validity in structures) 
is in general correct only for sentences y, as Theorem 3.1 will show. If 
A contains more than one element, then, for example, neither AF r= y 
nor AF xy. Indeed, r= y is falsified by any w such that 7” 4 y”, and 
xy by any w with «” = y”. This is one of the reasons why models were 
not simply identified with structures. 

For y € L let vy! be the sentence Vr, ---V%my, where 21,...,2m is 
an enumeration of freey according to index size, say. y% is called the 
generalized of , also called its universal closure. For py € L° clearly 
yp! =. From the definitions immediately results 

(1) AFy & AF y’%, 
and more generally, AF X @ AF X49 (:= {py? | yp € X}). (1) explains 
why y and 4 are often notionally identified, and the information that 


formally runs y% is often shortened to y. It must always be clear from 


2.3 Semantics of First-Order Languages 65 


the context whether our eye is on validity in a structure, or on validity in 
a model with its fixed valuation. Only in the first case can a generaliza- 
tion (or globalization) of the free variables be thought of as carried out. 


However, independent of this discussion, F y = F y% always holds. 
Even after just these incomplete considerations it is already clear that 

numerous properties of structures and whole systems of axioms can ad- 

equately be described by first-order formulas and sentences. Thus, for 


1 


example, an axiom system for groups in °,e,~*, mentioned already in 


2.1, can be formulated as follows: 

VaVyVz wo(yoz)=(xoy)oz; Varce=a2; Varor =e. 
Precisely, the sentences that follow from these axioms form the elementary 
group theory in o,e,—+. It will be denoted by TG . In the sense elaborated 
in Exercise 3 in 2.6 an equivalent formulation of the theory of groups in 


o,e, denoted by Tg, is obtained if the third Tg -axiom is replaced by 


Vatyxoy=e. Let us mention that Vraeox=a and Vadtyyox=e are 
provable in Tg and also in T=. 

An axiom system for ordered sets can also easily be provided, in that 
one formalizes the properties of being irreflexive, transitive, and connex. 
Here and elsewhere, Vx1---2,y stands for Vr, --- Van: 


Van fx; Vayz(a4<yay<z-2 < z); Vay(a#y or <yvy <2). 


In writing down these and other axioms the outer V-prefixes are very 
often omitted so as to save on writing, and we think implicitly of the 
generalization of variables as having been carried out. This kind of eco- 
nomical writing is employed also in the formulation of (1) above, which 
strictly speaking runs ‘for all A,p: AF y @& AF y?’. 

For sentences a of a given language it is intuitively clear that the values 
of the variables of w for the relation (A,w) F a are irrelevant. The 
precise proof is extracted from the following theorem for V = @. Thus, 
either (A,w) F a for all w and hence AF a, or else (A,w) F a for no w, 
ie., (A, w) F 7a for all w, and hence AF —a. Sentences therefore obey 
the already-cited tertium non datur. 


Theorem 3.1 (Coincidence theorem). Let V C Var, freey C V, and 
M,M!' be models on the same domain A such that e = 2’ for all 
xr e€V, ands“ =s™ for all extralogical symbols s occurring in yp. Then 


MEgo & M Ey. 
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Proof by induction on y. Let y = rt be prime, so that vart C V. As was 
mentioned earlier, the value of a term ¢t depends only on the meaning of the 
symbols occurring in ¢t. But in view of the suppositions, these meanings 
are the same in M and M’. Therefore, £“ = ¢ (i.e., a = ia for 
i=1,...,n), andsoMErt 6 MEM o MEM & ME rt. For 
equations tj = tz one reasons analogously. Further, the induction hypoth- 
esis for a, yields MF anBeMEaBeM Fabs M Fan. 
In the same way one obtains M - ~a & M’'F 7a. By the induction step 


on V it becomes clear that the induction hypothesis needs to be skillfully 
formulated. It must be given with respect to any pair M,M’ of models 
and any subset V of Var. 

Therefore let a € A and M2 FE yo. Since for V’ := V U {x} certainly 
free C V’ and the models M%, M'% coincide for all y € V’ (although 
in general «“ 4 aM"), by the induction hypothesis M% F yp = M'SE o, 
for each a € A. This clearly implies 

MEVzy eS MfF gg foralas MSF oy foralasM'EVay. LJ 


It follows from this theorem that an £-model M = (A,w) of y for 
the case that y € LC L’ can be completely arbitrarily expanded to an 
L'-model M’ = (A’,w) of y, ie., arbitrarily fixing s for s € L/\L 
gives MF yp & M’E po by the above theorem with V = Var. This 
readily implies that the consequence relation Fg, with respect to L’ is a 


conservative extension of Fe in that X Fe yp & X Fe y, for all sets 
X C £and all y € £. Hence, there is no need here for using indices. In 
particular, the satisfiability or general validity of y depends only on the 
symbols effectively occurring in ¢. 

Another application of Theorem 3.1 is the following fact, which justifies 
the already mentioned “omission of superfluous quantifiers.” 


(2) Vay = y = Ary whenever z ¢ free vy. 


Indeed, x ¢ freey implies MF yp @& M&F o (here a € A is arbitrary) 
according to Theorem 3.1; choose M’ = M‘ and V = freey. Therefore, 
MEVazp e& MELE ¢ foralas MEy 
= MF » for somea & MF Azy. 
Very important for the next theorem and elsewhere is 
(3) If ACB, M=(A,w), M' = (B,w) and w: Var — A then 
tM = tM", 
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This is clear for prime terms, and the induction hypothesis i = oe for 
i=1,...,n together with f@ = ‘iad imply 


(FEM = FMM. tet) = PH, tt) = (FEY. 
For M = (A,w) and x” = a; let t47, or more suggestively t4(@) denote 
the value of t = t(#). Then (3) can somewhat more simply be written as 

(4) ACB andt =t(2) imply t4(@) = t8(@) for all de A”. 

Thus, along with the basic functions, also the so-called term functions 
Gtr tA(a) are the restrictions to their counterparts in B. Clearly, ifn = 0 
or t is variable-free, one may write t4 for t4(@). Note that in these cases 
tA = t® whenever A C B, according to (4). 

By Theorem 3.1 the satisfaction of y in (A,w) depends only on the 
values of the x € freey. Let y = y(#)* and @ = (a1,...,an) € A”. Then 
the statement 

(A, w) F @ for a valuation w with xj” = a1,..., 2,” = an 
can more suggestively be expressed by writing 
(A,@) Fy or AF ylay,...,a,] or AF vy [al 
without mentioning w as a global valuation. Such notation also makes 
sense if w is restricted to a valuation on {21,..., 2}. One may accordingly 
extend the concept of a model and call a pair (A, @) a model for a formula 


y(Z) whenever (A, a) F y(Z), in particular if py € £”. We return to this 
extended concept in 4.1. Until then we use it only for n = 0. That is, 
besides M = (A,w) also the structure A itself is occasionally called a 
model for a set S C L° of sentences, provided AF S. 

As above let y = y(Z). Then v4 := {@ € A” | AF y[a]} is called the 
predicate defined by the formula y in the structure A. For instance, the 


<-predicate in (N,+) is defined by y(z,y) = dzz+x=y, but also by 
several other formulas. 

More generally, a predicate P C A” is termed (explicitly or elementarily 
or first-order) definable in A if there is some y = y(Z) with P = y4, and 
y is called a defining formula for P. Analogously, f: A” — A is called 
definable in A if y4 = graph f for some y(#,y). One often talks in this 


4 Since this equation is to mean freey C {x1,...,2%n}, # is not uniquely determined 
by y. Hence, the phrase “Let y = y(Z)...” implicitly includes along with a given ~ 
also a tuple Z given in advance. The notation y = y(#) does not even state that y 
contains free variables at all. 
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case of explicit definability of f in A, to distinguish it from other kinds 
of definability. Much information is gained from the knowledge of which 
sets, predicates, or functions are definable in a structure. For instance, 
the sets definable in (N,0,1,+) are the eventually periodic ones (periodic 
from some number on). Thus, - cannot explicitly be defined by +,0,1 
because the set of square numbers is not eventually periodic. 

A C Band y = ¢(Z) do not imply y4 = vy? nA”, in general. For 
instance, let A = (N,+), B = (Z,+), and y = 4zz24+ "4=y. Then 
yA _ <A 
will show, y4 = y8 nA” holds in general only for open formulas y, and 
is even characteristic for A C B provided A C B. Clearly, A C B is much 
weaker a condition than A C B: 


, while y® contains all pairs (a,b) € Z?. As the next theorem 


Theorem 3.2 (Substructure theorem). For structures A,B such that 
ACB the following conditions are equivalent: 


(i) ACB, 
(ii) AE g[a] = BE od], for all open py = y(Z) and all G € A”, 
(iii) AE y[a] 6 BE vlad], for all prime formulas y(z) and a € A”. 


Proof. (i)= (ii): It suffices to prove that MF yp & M’ — 9, with 
M = (A,w) and M’ = (B,w), where w: Var — A. In view of (3) the 
claim is obvious for prime formulas, and the induction steps for ~,—7 are 
carried out just as in Theorem 3.1. (ii)=-(iii): Trivial. (iii)=>(i): By (iii), 
rAd & AE rz [a] 6 BE rz [a] 6 r8a. Analogously, 

fAd=b & AF f#=yla,b] o BE f#=y/a,b] = fFa=b, 


for all a € A", b€ A. These conclusions state precisely that ACB. (J 


Let a be of the form Vz with open @, where Vz may also be the empty 
prefix. Then a is a universal or V-formula (spoken “A-formula”), and for 
a € £L° also a universal or V-sentence. A simple example is VrVy x= y, 


which holds in A iff A contains precisely one element. Dually, 473 with 


G open is termed an 3-formula, and an J-sentence whenever 37%G € L°. 
Examples are the “how-many sentences” 
1 (= 3V0 Vo = VO; n (= 1V0°°': tUy 1 Nicjen ViFY; (> 1). 


4, states ‘there exist at least n elements’, =4,,41 thus that ‘there exist 


at most n elements’, and dep, := JpA7dni1 says ‘there exist exactly 
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n elements’. Since 4, is a tautology, it is convenient to set T := qh, 


and do := 1 := —7 in all first-order languages with equality. Clearly, 
equivalent definitions of T, | may be used as well. 


Corollary 3.3. Let AC B. Then every V-sentence Via valid in B is also 
satisfied in A. Dually, every d-sentence A#G valid in A is also valid in B. 


Proof. Let B — VZB and a € A”. Then BF @[@], hence AF ( [a] by 
Theorem 3.2. @ was arbitrary and therefore AF V2G. Now let AF Az. 
Then AF (dq for some @ € A”, hence B F 3[G] by Theorem 3.2, and 


consequently BF 37GB. (LJ 


We now formulate a generalization of certain individual often-used ar- 
guments about the invariance of properties under isomorphisms: 


Theorem 3.4 (Invariance theorem). Let A,B be isomorphic structures 
of signature L and let1: A > B be an isomorphism. Then for all y = y(Z) 


AF [a] © BE yfa (ae A”, = (20,...,2an)). 


In particular AF p & BE 9, for all sentences of L. 


Proof. It is convenient to reformulate the claim as 


MEyp & ME (M = (A,w), M' = (B,w'), wi: 2H 22”). 
tM) — 4M’ 


This is easily confirmed by induction on ¢ after first proving 2( 
inductively on t. This proof clearly includes the case y € L°. LY 


Thus, for example, it is once and for all clear that the isomorphic image 
of a group is a group even if we know at first only that it is a groupoid. 
Simply let y in the theorem run through all axioms of group theory. 
Another application: Let 2 be an isomorphism of the group A = (A,°) 
onto the group A’ = (A’,c) and let e and e’ denote their unit elements, 
not named in the signature. We claim that nonetheless 2e = e’, using 
the fact that the unit element of a group is the only solution of ror=a 


(Example 2, page 83). Thus, since A F ece=e, we get A’ F 1e°1e = 1e by 
Theorem 3.4, hence ze = e’. Theorem 3.4, incidentally, holds for formulas 
of higher order as well. For instance, the property of being a continuously 
ordered set (formalizable in a second-order language, see 3.8) is likewise 
invariant under isomorphism. 

£-structures A,B are termed elementarily equivalent if AF as BFEa, 
for alla € £°. One then writes A = B. We consider this important notion 
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in 3.3 and more closely in 5.1. Theorem 3.4 states in particular that 
A~B => A=B. The question immediately arises whether the converse 
of this also holds. For infinite structures the answer is negative (see 3.3), 
for finite structures affirmative; a finite structure of a finite signature can, 
up to isomorphism, even be described by a single sentence. For example, 
the 2-element group ({0,1},-+) is up to isomorphism well determined by 
the following sentence, which tells us precisely how + operates: 


Vo V1 [V0 F V1 AV2(x= V0 Vv L=V}) 


A Vo + V9= V1 + V1 = VO Ao +01 =v} + VO= V4]. 


We now investigate the behavior of the satisfaction relation under sub- 
stitution. The definition of y£ in 2.2 pays no attention to collision of 
variables, which is taken to mean that some variables of the substitution 
term ¢ fall into the scope of quantifiers after the substitution has been 
performed. In this case M F Vay does not necessarily imply MF y£, 
although this might have been expected. In other words, Vry F y£ is 


not unrestrictedly correct. For instance, if gy = dyx #y then certainly 


M EF Vay (= Vedyx # y) whenever M has at least two elements, but 


ME ov ¥ (= Jyy#y) is certainly false. Analogously y £ F Ixy is not 
correct, in general. For example, choose Vy x= y for y and y for t. 


One could forcibly obtain Vry F y £ without any limitation by renam- 
ing bound variables by a suitable modification of the inductive definition 
of y £ in the quantifier step. However, such measures are rather unwieldy 
for the arithmetization of proof method in 6.2. It is therefore preferable 
to put up with minor restrictions when we are formulating rules of deduc- 
tion later. The restrictions we will use are somewhat stronger than they 
need to be but can be handled more easily; they look as follows: 


Call y,£ collision-free if y ¢ bndy for all y € vart distinct from 2. 
We need not require x ¢ bndy because t is substituted only at free oc- 
currences of x in y, that is, x cannot fall after substitution within the 
scope of a prefix Vx, even if « € vart. For collision-free y,£ we always 
get Vry EF y £ by Corollary 3.6 below. 


If o is a global substitution (see 2.2) then , 7 are termed collision-free 
if yp, = are collision-free for every x € Var. If o = 4, this condition clearly 


need be checked only for the pairs y, % with x € varé and x € free. 
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For M = (A,w) put M? := (A,w’) with 2” := (a7) for 2 € Var, 
so that 2" = 27 (= (#7)™). This equation reproduces itself to 


(5) #M@° — 1 for all terms t. 
Tdeed Sef an EO Se FOR ie ee TE for 
t = ft in view of the induction hypothesis ne = t? 
Notice that M° coincides with Men for the case o = £ 
Theorem 3.5 (Substitution theorem). Let M be a model and oa a 
global substitution. Then holds for all py such that y,o are collision-free, 
(6) MEy? = “ Fy. 
In particular, MF pt = Men EF wy, provided y,4 are collision-free. 


Proof by induction on y. In view of (5), we obtain 


MF (t=t2)? @ HM = tM & YW aH & MeFi Ste. 


Prime formulas rf are treated analogously. The induction steps for ~,- 
in the proof of (6) are harmless. Only the V-step is interesting. The 
reader should recall the definition of (Vza)” page 60 and realize that the 
induction hypothesis refers to an arbitrary global substitution rT. 
ME (Vara)? <= ME Var at (x7 =x and y? = y’ else) 

= MF a’ for alla (definition) 
= (M%)" — a for all a a hypothesis) 
<= (M°)SF a for alla ((M%)" = (M7), see below) 

& M°F Vaa. 
We show that (M%)7 = (M7)%. Since Vra,o (hence Va, vr for every y) 
are collision-free, we have x ¢ vary? if y # x, and since y’ = y? we get 
M3)" = yTMt = yoMe — yoM — yM? — (M2 But also 
(Mi)" = gT MG = Mt aga Me, 


in this case y! 
in the case y = x we have x 


Corollary 3.6. For all y and z such that yf are collision-free, the 
following properties hold: 


(a) V2pFE is in particular VrpE yt, (b) y i FS 
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P; 


(c) p£,s=teE yt, provided vy, £,4 are collision-free. 
Proof. Let M F Vzy, so that Mé FE » for all d € A”. In particular, 
Mis FE ». Therefore, M F pt by Theorem 3.5. (b) follows easily from 


Axyp F mp i This holds by (a), for ~A%yp = VF¥-7¢y and =(y ‘) = (-y) e 
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(c): Let ME yp &,s=t, so that s =t™ and M5" F ¢ by the theorem. 
Clearly, then also Min Fy. Hence MF yt. TY 


Remark 2. The identical substitution « is obviously collision-free with every 
formula. Thus, Vzy F vy (= y’) is always the case, while Vzy F y £ is correct 
in general only if t contains at most the variable x, since y, $ are then collision- 
free. Theorem 3.5 and Corollary 3.6 are easily strengthened. Define inductively 
a ternary predicate ‘t is free for x in y’, which intuitively is to mean that no free 
occurrence in y of the variable x lies within the scope of a prefix Vy whenever 
y € vart. In this case Theorem 3.5 holds for ¢o = £ as well, so that nothing 
needs to be changed in the proofs based on this theorem if one works with 
‘t is free for x in y’, or simply reads “y, £ are collision-free” as “t is free for x in 
y.” Though collision-freeness is somewhat cruder and slightly more restrictive, 
it is for all that more easily manageable, which will pay off, for example, in 6.2, 
where proofs will be arithmetized. Once one has become accustomed to the 
required caution, it is allowable not always to state explicitly the restrictions 
caused by collisions of variables, but rather to assume them tacitly. 


Theorem 3.5 also shows that the quantifier “there exists exactly one,” de- 


noted by 2!, is correctly defined by Alay := Ary a VaVy(pay ¥ -r=y) 
with y ¢ vary. Indeed, it is easily seen that MF VaVy(pay 4 -x2=y) 
means just MF yp & Me FEwy% => a=b. Inshort, M“E ¢ for at most 
one a. Putting everything together, M F a!axy iff there is precisely one 
a € Awith M&F vy. An example is M F d!ax=t for arbitrary M and 
x ¢ vart. In other words, J!axz=t is a tautology. Half of this, namely 


F dr 2=t, was shown in Example 1, and F VaVy(a@=tay=t -2x=y) is 


obvious. There are various equivalent definitions of 4!ay. For example, 


a short and catchy formula is drVy(yp % — «=y), where y ¢ vary. The 
equivalence proof is left to the reader. 


Exercises 
1. Let X F y and « ¢ free X. Show that X F Vay. 


2. Prove that Vz(a >) F Vra +Va, which is obviously equivalent 
to F Va(a > B) —Vara —VaZ. 


3. Suppose A’ results from A by adjoining a constant symbol a for 
some a € A. Prove AF ala] & A’ F a(a) (= a $) for a = a(z), 
by first verifying t(a)4" = t(a) 4’. This is easily generalized to the 
case of more than one free variable in a. 
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4. Show that (a) A conjunction of the 4; and their negations is equiva- 


lent to 4, A754, for suitable n,m (An A7d9 = An, 44 A7Idm = 75m). 
(b) A Boolean combination of the 4; is equivalent to V)<, I=, OF 
to dk Vv Vien dak» With kp <--> < ky <k. Note that Vi<, den, 


equals 4—p (= 1) for n=kp=0 and —4, = Ven d=v for n>0. 
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From the perspective of predicate logic a v 7a (a € L) is a trivial example 
of a tautology, because it results by inserting a for p from the propositional 
tautology p v 7p. Every propositional tautology provides generally valid 
£-formulas by the insertion of £-formulas for the propositional variables. 
But there are tautologies not arising in this way. Va(a < ava ¢ 2) is 
an example, though it has still a root in propositional logic. Tautologies 


without a such a root are Jrx=z and dr x=t for x ¢ vart. The former 
arises from the convention that structures are always nonempty, the latter 
from the restriction to totally defined basic operations. A particularly 
interesting tautology is given by the following 


Example 1 (Russell’s antinomy). We will show that the “Russellian 
set” u, consisting of all sets not containing themselves as a member, does 


not exist which clearly follows from F ~duVa(xeu- x¢x). We start with 
Va(zneuc av¢x) Fucus ué¢u. This holds by Corollary 3.6(a). Clearly, 
ueu u¢uis unsatisfiable. Hence, the same holds for Vz(reu @ r¢2x), 


and thus for JuVz(r@euo x¢x). Consequently, F ~duVa(reuo r¢x). 


Note that we need not assume in the above argument that € means 


membership. The proof of F ~duVx(reu — x¢x) need not be related to 
set theory at all. Hence, our example represents rather a logical paradox 
than a set-theoretic antinomy. What looks like an antinomy here is the 


expectation that JuVa(xeu << x¢2x) should hold in set theory if € is to 
mean membership and Cantor’s definition of a set is taken literally. 

The satisfaction clause for a—£ easily yields aF 6 @& Fa-—Q, 
a special case of Xia F 6 & X Fa—QZ. This can be very useful 
in checking whether formulas given in implicative form are tautologies, as 


was mentioned already in 1.3. For instance, from Vza F a £ (which holds 
for collision-free a, £) we immediately get F Vra a4. 
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As in propositional logic, a = 6 is again equivalent to F a « @. 
By inserting £-formulas for the variables of a propositional equivalence 
one automatically procures one of predicate logic. Thus, for instance, 
a—-GB=-7a\v £, because certainly p —-q = 7p v q. Since every £-formula 
results from the insertion of propositionally irreducible £-formulas in a 
formula of propositional logic, one also sees that every £-formula can be 
converted into a conjunctive normal form. But there are also numerous 


other equivalences, for example ~Vra = dara and 7~dra = Vera. The 


first of these means just ~Vra = 7=Var77a (= dr7a), obtained by replac- 
ing a by the equivalent formula —7q@ under the prefix Vz. This is a simple 
application of Theorem 4.1 below with = for ~. 


As in propositional logic, semantic equivalence is an equivalence relation 
in £ and, moreover, a congruence in £L. Speaking more generally, an 
equivalence relation ~ in £ satisfying the congruence property 


CP: ava’, 822 => anBead vf’, naz -a’, Vea = Vara! 


is termed a congruence in £L. Its most important property is expressed by 


Theorem 4.1 (Replacement theorem). Let ~ be a congruence in L 
anda & a’. If vy’ results from yp by replacing the formula a at one or 


more of its occurrences in yp by the formula a’, then yp & y’. 


Proof by induction on y. Suppose ¢ is a prime formula. Both for y= a 
and y 4a, py & vy’ clearly holds. Now let y = yiage. In case yp = a 
holds trivially yp + y’. Otherwise y’ = y Ay5, where vy}, y result from 
1,¥%1 by possible replacements. By the induction hypothesis y, ~ 4 
and y2 ¥ yy. Hence, yp = Yirye & vi Avs = y’ according to CP above. 
The induction steps for =, V follow analogously. [J 


This theorem will constantly be used, mainly with = for ~, without 
actually specifically being cited, just as in the arithmetical rearrangement 
of terms, where the laws of arithmetic used are hardly ever named ex- 
plicitly. The theorem readily implies that CP is provable for all defined 


connectives such as — and 3. For example, a ~ a’ ra = Iza’, 


because a & a’ > Sra = “Vara & -Vexr-a! = dra’. 


First-order languages have a finer structure than those of propositional 
logic. There are consequently further interesting congruences in £. In 
particular, formulas a, 3 are equivalent in an L£L-structure A, in symbols 
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a=, B, if AF al[w] @ AF Blu], for all w. Hence, in A = (N, <,+,0) 
the formulas x < y and 4z(z4#0 02+ 2=y) are equivalent. The proof 


of CP for =, is very simple and is therefore left to the reader. 
Clearly, a =, G is equivalent to AF a << G@. Because of = C =y, 


properties such as ~Vra = dx-7a carry over from = to =,. But there 
are often new interesting equivalences in certain structures. For instance, 
there are structures in which every formula is equivalent to a formula 
without quantifiers, as we will see in 5.6. 

A very important fact with an almost trivial proof is that the intersec- 
tion of a family of congruences is itself a congruence. Consequently, for 
any class K #4 @) of £-structures, =K := (\{=4| A € K} is necessarily a 
congruence. For the class K of all £-structures, =K equals the logical 
equivalence =, which in this section we deal with exclusively. Below we 
list its most important features; these should be committed to memory, 
since they will continually be applied. 


(1) Va(an B) = VranVap, (2) Agr(av B) = Ara v ArB, 


(3) VaVya = VyVra, (4) Agrdya = dydara. 
If x does not occur free in the formula (@, then also 
(5) Valav B)=Vazav p, (6) Ar(anPB) = AranQB, 
(7) VaP = B, (8) Arg = 8, 
(9) Vala +8) = Ara 8B, (10) Aa(a +B) =Vra 7p. 


The simple proofs are left to the reader. (7) and (8) were stated in (2) 
in 2.3. Only (9) and (10) look at first sight surprising. But in practice 
these equivalences are very frequently used. For instance, consider for a 
fixed set of formulas X the evidently true metalogical assertion ‘for all a: 
if X Fa,7a then X F Vax¥x’. This clearly states the same as ‘If there 
is some a such that X Fa,7a then X F Vara’. 


Remark. In everyday speech variables tend to remain unquantified, partly be- 
cause in some cases the same meaning results from quantifying with “there exists 
a” as with “for all.” For instance, consider the following three sentences, which 
obviously tell us the same thing, and of which the last two correspond to the 
logical equivalence (9): 

e If a lawyer finds a loophole in the law it must be changed. 

e If there is a lawyer who finds a loophole in the law it must be changed. 

e For all lawyers: if one of them finds a loophole in the law then it must be 

changed. 
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Often, the type of quantification in linguistic bits of information can be made 
out only from the context, and this leads not all too seldom to unintentional (or 
intentional) misunderstandings. “Logical relations in language are almost always 
just alluded to, left to guesswork, and not actually expressed” (G. Frege). 


Let x,y be distinct variables and a € £. One of the most important 
logical equivalences is renaming of bound variables (in short, bound re- 
naming), stated in 

(11) (a) Vea =Vy(a¥), (b) Fea = Ay(a¥) (y ¢ vara), 
(b) follows from (a) by rearranging equivalently. Note that y ¢ vara is 


equivalent to y ¢ freea and a, ¥ collision-free. Writing M% for Men (a) 
derives as follows: 
MEVaa & MEFa for alla (definition) 
= (Mij)t Fa for alla ne 3.1) 
& (M%)tEa foralla ((M%)% = (M%)2) 
& Mirfak foralla (T Thearen 3.5) 
& MEVy(ag). 
(12) and (13) below are also noteworthy. According to (13), substitu- 
tions are completely described up to logical equivalence by so-called free 


renamings (substitutions of the form #). (13) also embraces the case 


x € vart. In (12) and (13) we tacitly assume that a, £ are collision-free. 
= Idae(r=taa) (x € vart). 
y 


(12) Va(a=t—-a) = 

(13) Vy(y=ta%) Sag =Jy(y=trnak) (y¢ vara,t). 
Proof of (12): Va(r=t—a) F (rx=t-a)£ = t=toak Fak by 
Corollary 3.6. Conversely, let MF a. If M2 — «=t¢t then clearly 


a=t™. Hence also M2 F a, since Mi Fa. Thus, Mi — c=t —a for 
any a € A, ie, M — Va(x=t —a). This proves the left equivalence in 


(12). The right equivalence reduces to the left one because 


da(a@=ta a) = War(r=t aa) = We(r=t — 7a) =ab Had. 


Item (13) is proved similarly. Note that Vy(y=t—-a#)Fati=az 
by Corollary 3.6 and Exercise 4 in 2.2. 

With the above equivalences we can now regain an equivalent formula 
starting with any formula in which all quantifiers are standing at the be- 


ginning. But this result requires both quantifiers V and J, in the following 


denoted by Q,Q;,Q2,... 
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A formula of the form a = Q)21---Qn%,G with an open formula 
is termed a prenex formula or a prenex normal form, in short, a PNF. 
G is called the kernel of a. W.l.o.g. 21,...,2n are distinct and x; occurs 
free in @ since we may drop “superfluous quantifiers,” see (2) page 66. 
Prenex normal forms are very important for classifying definable number- 
theoretic predicates in 6.3, and for other purposes. The already mentioned 


V- and +formulas are the simplest examples. 


Theorem 4.2 (on the prenex normal form). Every formula yp is 
equivalent to a formula in prenex normal form that can effectively be con- 
structed from vp. 


Proof. Without loss of generality let y contain (besides =) at most the 


logical symbols =, ,,V,4. For each prefix Qx in y consider the number of 
symbols - or left parentheses occurring to the left of Qa. Let sy be the 
sum of these numbers, summed over all prefixes occurring in y. Clearly, 
yisa PNF iff sp = 0. Let sp # 0. Then y contains some prefix Qxz and — 
or A stands immediately in front of Qz. A successive application of either 


Wara = dr-a, 7Ara = Vera, or (BAQra) = Qy(Bra ¥) (y ¢ vara, 3), 
inside w obviously reduces sy stepwise. [J 


Example 2. Vrdy(x1#0 > 2-y=1) isa PNF for Va(x40 > dy x-y=1). 
And JrVyVz(ya(y $ aps >my=2z)) for drpaVyV2(y aps > y=2), 
provided y, z ¢ freey; if not, a bound renaming will help. An equivalent 


PNF for this formula with minimal quantifier rank is daVy(y $ @ x=y). 


The formula Va(x40 — dy x-y= 1) from Example 2 may be abbreviated 
by (Vz¥40)4ya+-y=1. More generally, we shall often write (Vz ¥t)a for 
Va(a¥#t >a) and (Ar¥t)a for da(a¥t aa). A similar notation is used 
for <, <, € and their negations. For instance, (V~<t)a and (dr<t)a 


are to mean Va(a<t >a) and dz(x<t a a), respectively. For any binary 
relation symbol <, the “prefixes” (Vy<jz) and (dy<jz) are related to each 


other, as are V and 4, see Exercise 2. 


Exercises 


1. Let a= B. Prove that a a = ge (a, a and £, e collision-free). 


2. Prove that 7=(Vr<dy)a = (dr<y)ra and 7(Sr<dy)a = (Va<y)na. 
Here < represents any binary relation symbol. 
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3. Show by means of bound renaming that both the conjunction and 
the disjunction of V-formulas a, @ is equivalent to some V-formula. 


Prove the same for -formulas. 


4. Show that every formula y € CL is equivalent to some vy’ € L built 


up from literals by means of a, v, V, and i. 


5. Let P be a unary predicate symbol. Prove that Jz(Px —VyPy) is 
a tautology. 


6. Call a, G € L tautologically equivalent if F a = F GB. Confirm that 
the following (in general not logically equivalent) formulas are tau- 
tologically equivalent: a, Vza, and a §, where the constant symbol 
c does not occur in a. 


2.5 Logical Consequence and Theories 


Whenever L’ D CL, the language L’ is called an expansion or extension 
of £ and L a reduct or restriction of L’. Recall the insensitivity of the 
consequence relation to extensions of a first-order language, mentioned in 
2.3. Theorem 3.1 yields that establishing X F a does not depend on the 
language to which the set of formulas X and the formula a belong. For 
this reason, indices for F, such as Fz, are dispensable. 

Because of the unaltered satisfaction conditions for ~ and -, all prop- 
erties of the propositional consequence gained in 1.3 carry over to the 
first-order logical consequence relation. These include general properties 
such as, for example, the reflexivity and transitivity of F, and the seman- 
tic counterparts of the rules (a1), (a2), (41), (42) from 1.4, for instance 

rs 
XFaap 


In addition, Gentzen-style properties such as the deduction theorem 


the counterpart of (1), 


automatically carry over. But there are also completely new properties. 
Some of these will be elevated to basic rules of a logical calculus for first- 
order languages in 3.1, to be found among the following ones: 


>A suggestive way of writing “X F a, implies X F aa,” a notation that was 
introduced already in Exercise 3 in 1.3. A corresponding notation will also be used 


in stating the properties of F on the next page. 
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Some properties of the predicate logical consequence relation. 


(a) oe (a, £ collision-free), 
r 
(b) = _— — (a, £ and a, £ collision-free), 
(c) nase (anterior generalization), 
(d) it (x ¢ free X, posterior generalization), 
(e) eages (x ¢ free X, a, anterior particularization), 
(f) . : = (a, £ collision-free, posterior particularization) 


(a) follows from X - Vrak a £, for F is transitive. Similarly, (b) follows 


from a §,s=tk a £, stated in Corollary 3.6. Analogously (c) results from 
Ya — B. To prove (d), suppose that X F a, MF X, and a ¢ freeX. 
Then M% — X for any a € A by Theorem 3.1, which just means M F Va. 
As regards (e), let X,@F a. Observe that by contraposition and by (d), 


X,GFa X,7AaF AG X,7AaF Va-7£B, 


whence X,7=Vaz—-= F a. (e) captures deduction from an existence claim, 


while (f) confirms an existence claim. (f) holds since a £ - dra according 

to Corollary 3.6. Both (e) and (f) are permanently applied in mathemati- 

cal reasoning and will briefly be discussed in Example 1 on the next page. 

All above properties have certain variants; for example, a variant of (d) is 
XFa$ 

(crane 
X F Vara 

This results from (d) with a ¥ for a and y for x, since Vya ¥ = Vara. 


y ¢ free X U vara). 


From the above properties, complicated chains of deduction can, where 
necessary, be justified step by step. But in practice this makes sense only 
in particular circumstances, because formalized proofs are readable only 
at the expense of a lot of time, just as with lengthy computer programs, 
even with well-prepared documentation. What is most important is that a 
proof, when written down, can be understood and reproduced. This is why 
mathematical deduction tends to proceed informally, i.e., both claims and 
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their proofs are formulated in a mathematical “everyday” language with 
the aid of fragmentary and flexible formalization. To what degree a proof 
is to be formalized depends on the situation and need not be determined 
in advance. In this way the strict syntactic structure of formal proofs is 
slackened, compensating for the imperfection of our brains in regard to 
processing syntactic information. 

Further, certain informal proof methods will often be described by a 
more or less clear reference to so-called background knowledge, and not 
actually carried out. This method has proven itself to be sufficiently 
reliable. As a matter of fact, apart from specific cases it has not yet been 
bettered by any of the existing automatic proof machines. Let us present 
a very simple example of an informal proof in a language £ for natural 
numbers that along with 0,1,+,+ contains the symbol | for divisibility, 


defined by m|n & dkm-k =n. In addition, let £ contain a symbol f for 
some given function from N to N,. We need no closer information on this 
function, but we shall write f; for f(i) in the example. 


Example 1. We want to prove Vn(dx>0)(Vi<n)f; |x, ie., fo,...,f, have 
a common multiple (in N_). The proof proceeds by induction on n. Here 
we focus solely on X, (Sr>0)(Vixn)fila F (dr>0)(Vi<n+1)fila, the in- 
duction step. X represents our prior knowledge about familiar properties 


I 
— 


of divisibility. Informally we reason as follows: Assume (Sx>0)(Vix<n)f; |x 
and choose any such xz. Then «+ f,41 is obviously a common multiple of 
fo,.--,fn41, hence (Sr>0)(Vi<n+1)filvz. To argue here formally like a 


proof machine, we start from (Vi<n)fjla F (Vi<n+1)fil(x + fn4i). Then 
posterior particularization yields X,(Vi<n)fila F (Sx>0)(Vix<nt+1)fila 
since «+ fn+41 > 0, and so, after anterior particularization, we get the de- 
sired X, (dr>0)(Vi<n)f;|a F (Ax>0)(Vixn+1)filz. This example shows 
that formalizing a nearly trivial informal argument may need a lot of 


writing without making things more lucid for mathematicians. 


Some textbooks deal with a somewhat stricter consequence relation, 


which we denote here by The reason is that in mathematics one largely 


considers derivations in theories. For X C £ and y € £ define X 2 yp if 
AF X => AF 4g, for all L-structures A. In contrast to —, which may 
be called the local consequence relation, can be considered as the global 


consequence relation since it cares only about A, not about a concrete 


valuation w in A as does F. 
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Let us collect a few properties of 2 Obviously, X — vy implies X 2 yp, but 
the converse does not hold in general. For example, «= y 2 Veyx=y, 
but z=y - Vryx=y. By (d) from page 79, X F y > X F yw! holds 
in general only if the free variables of y do not occur free in X, while 
xe pa x 2 yp (hence y a ~y’) holds unrestrictedly. A reduction of 2 
to F is provided by the following equivalence, which easily follows from 
MEX! = AEX, for each model M = (A,w): 

(1) XFy o XIE. 

Because of S'% = S' for sets of sentences S, we clearly obtain from (1) 

2) SEp se Sky (SCL). 


In particular, 2 p= Fy. Thus, a distinction between F and is apparent 


only when premises are involved that are not sentences. In this case the 
relation # must be treated with the utmost care. Neither the rule of case 
g g g 
sae | Sala nor the deduction theorem cdl is 
xB X Poa 
unrestrictedly correct. For example «= y 2 Veyx=y, but it is false that 


distinction 


fox y ~Vaeyx=y. This means that the deduction theorem fails to hold 
for the relation #2. It holds only under certain restrictions. 


One of the reasons for our preference of F over is that K extends the 
propositional consequence relation conservatively, so that features such as 
the deduction theorem carry over unrestrictedly, while this is not the case 


for #. It should also be said that ¥ does not reflect the actual procedures of 
natural deduction in which formulas with free variables are frequently used 
also in deductions of sentences from sentences, for instance in Example 1. 


We now make more precise the notion of a formalized theory in £, where 
it is useful to think of the examples in 2.3, such as group theory. Again, 
the definitions by different authors may look somewhat differently. 


Definition. An elementary theory or first-order theory in L, also termed 
an £-theory, is a set of sentences T’ C L° deductively closed in L°, i.e., 
TFEa © aé€T, for alla € £L°. If a € T then we say that a is valid 
or true or holds in T, or a is a theorem of T. The extralogical symbols 
of £ are called the symbols of T. If T C T’ then T is called a subtheory 
of T’, and T’ an extension of T. An L-structure A such that AF T is 
also termed a model of T, briefly a T-model. MdT denotes the class of 
all models of T in this sense; Md T consist of £-structures only. 
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For instance, {a € £L°| X F a} is a theory for any set X C CL, since F 
is transitive. A theory T in L satisfies TF p = AF yg forall AFT, 
where y € £ is any formula. Important is also TF yp = TF vy. These 
readily confirmed facts should be taken in and remembered, since they 
are constantly used. Different authors may use different definitions for a 
theory. For example, they may not demand that theories contain sentences 
only, as we do. Conventions of this type each have their advantages and 
disadvantages. Proofs regarding theories are always adaptable enough to 
accommodate small modifications of the definition. Using the definition 
given above we set the following 
Convention. In talking of the theory S, where S is a set of sentences, we 
always mean the theory determined by S, that is, {a € £L°| SF a}. A set 
X CCL is called an aziom system for T whenever T = {a € L°| X27 Fa}, 
i.e., we tacitly generalize all possibly open formulas in X. We have always 


to think of free variables occurring in axioms as being generalized. 


Thus, axioms of a theory are always sentences. But we conform to stan- 
dard practice of writing long axioms as formulas. We will later consider 
extensive axiom systems (in particular, for arithmetic and set theory) 
whose axioms are partly written as open formulas just for economy. 

There exists a smallest theory in £, namely the set Taut (= Tautz) of all 
generally valid sentences in £, also called the “logical” theory. An axiom 
system for Taut is the empty set of axioms. There is also a largest the- 
ory: the set £L° of all sentences, the inconsistent theory, which possesses 
no models. All remaining theories are called satisfiable or consistent.°® 
Moreover, the intersection T = (),<; 7; of a nonempty family of theories 
T; is in turn a theory: if TF a € £° then clearly T; — a and so a € T; for 
each 7 € I, hence a € T as well. In this book T and J’, with or without 
indices, exclusively denote theories. 

For T C £L° and a € L£° let T + a denote the smallest theory that 
extends T and contains a. Similarly let T+ S for S C L£L° be the smallest 
theory containing TUS. If S$ is finite then 7’ = T+ 5S =T+/S is called 
a finite extension of T. Here /\ S denotes the conjunction of all sentences 
in S. A sentence a is termed compatible or consistent with T if T + a is 


®° Consistent mostly refers to a logic calculus, e.g., the calculus in 3.1. However, it will 
be shown in 3.2 that consistency and satisfiability of a theory coincide, thus justifying 
the word’s ambiguous use. 
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satisfiable, and refutable in T if T +a is satisfiable. Thus, the theory Tp 
of fields is compatible with the sentence 1 + 1=0. Equivalently, 1 + 140 
is refutable in Tp, since the 2-element field satisfies 1 + 1=0. 

If both a and 7a are compatible with T then the sentence a is termed 
independent of T. The classic example is the independence of the parallel 
axiom from the remaining axioms of Euclidean plane geometry, which 
define absolute geometry. Much more difficult is the independence proof 
of the continuum hypothesis from the axioms for set theory. These axioms 
are presented and discussed in 3.4. 


At this point we introduce another important concept; a,3 € L are 


said to be equivalent in or modulo T, a =r GB, if a =, G for all AF T. 
Being an intersection of congruences, =7 is itself a congruence and hence 
satisfies the replacement theorem. This will henceforth be used without 
mention, as will the obvious equivalence of a =7 0, TF a << £, and of 
TF (a< @B)%. A suggestive writing of a =r 2 would also be a=73. 


Example 2. Let Tq be as on p. 65. Claim: ror=x =p, c=e. The only 


tricky proof step is Tg F row=ax2 — x=e. Let xox=-2z and choose some 
y with coy=e. The claim then follows from x=x°e°e=xoxoy=xoy=e. 
A strict formal proof of the latter uses anterior particularization. 


Another important congruence is term equivalence. Call terms s,t 


equivalent modulo (or in) T, in symbols s x7 t, if T — s=t, that is, 
AF s=t[w] for all AF T and w: Var—A. For instance, in T = Te, 
(xoy)-t=y-l 1 1 1 


on 
Another example: in the theory of fields, each term is equivalent to a 


ox! is easily provable, so that (xoy)~! xp yA 
polynomial in several variables with integer coefficients. 

If all axioms of a theory T are V-sentences then T is called a univer- 
sal or V-theory. Examples are partial orders, orders, rings, lattices, and 
Boolean algebras. For such a theory, Md T is closed with respect to sub- 
structures, which means A C BF T > AE T. This follows at once 
from Corollary 3.3. Conversely, a theory closed with respect to substruc- 
tures is necessarily a universal one, as will turn out in 5.4. V-theories are 
further classified. The most important subclasses are equational, quasi- 
equational, and universal Horn theories, all of which will be considered to 


some extent in later chapters. Besides V-theories, the Va-theories (those 


having Va-sentences as axioms) are of particular interest for mathematics. 
More about all these theories will be said in 5.4. 


84 2 First-Order Logic 


Theories are frequently given by structures or classes of structures. The 
elementary theory ThA and the theory Th K of a nonempty class K of 
structures are defined respectively by 


ThA := {ae L|AEa}, ThK :=(\{ThA|A€ K}. 


It is easily seen that ThA and ThK are theories in the precise sense 
defined above. Instead of a € Th K one often writes kK F a. In general, 
Md 7h K is larger than K, as we shall see. 

One easily confirms that the set of formulas breaks up modulo T (more 
precisely, modulo =) into equivalence classes; their totality is denoted 
by BT. Based on these we can define in a natural manner operations 
A,V,7. For instance, @A3 = aa, where ¢ denotes the equivalence 
class to which y belongs. One shows easily that B,,T’ forms a Boolean 
algebra with respect to a,v,7. For every n, the set B,T of all @ in 
BT such that the free variables of y belong to Var, (= {vo,.--,Un—1}) 
is a subalgebra of B,7. Note that BoT is isomorphic to the Boolean 
algebra of all sentences modulo =7, also called the Tarski-Lindenbaum 
algebra of T. The significance of the Boolean algebras B,T is revealed only 
in the somewhat higher reaches of model theory, and they are therefore 
mentioned only incidentally. 


Exercises 


1. Suppose x ¢ free X and c is not in X,a. Prove the equivalence of 


(i) X Fa, (ii) X F Vara, (iii) X Fag. 


This holds then in particular if X is the axiom system of a theory 
or itself a theory. Then x ¢ free X is trivially satisfied. 


2. Let S be a set of sentences, a and (3 formulas, x ¢ free G, and let c 
be a constant not occurring in S, a, 3. Show that 


SEFag ~-68 & SEFAra—8. 


3. Verify for alla,@GeEL° that BET+a S a-GeT¥. 


4. Let T C £ bea theory, Lp C L, and Tp := T NLo. Prove that To is 
also a theory (the so-called reduct theory in the language Lo). 
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2.6 Explicit Definitions—Language Expansions 


The deductive development of a theory, be it given by an axiom system 
or a single structure or classes of those, nearly always goes hand in hand 
with expansions of the language carried out step by step. For example, 
in developing elementary number theory in the language £(0,1,+,-), the 
introduction of the divisibility relation by means of the (explicit) definition 


tly o dza-z=y has certainly advantages not only for purely technical 
reasons. This and similar examples motivate the following 


Definition I. Let r be an n-ary relation symbol not occurring in £. An 
explicit definition of r in £L is to mean a formula of the shape 


tr: TE O(Zz) 
with 6() € £ and distinct variables in 7, called the defining formula. For 


a theory T C ZL, the extension 7). := T + 77 is then called a definitorial 
extension (or expansion) of T by r, more precisely, by 7,. 


T, is a theory in L[r], the language resulting from £ by adjoining the 
symbol r. It will turn out that T;. is a conservative extension of T’, which, 
in the general case, means a theory TJ” D T in L’ D L such that T’NL = T. 
Thus, 7; contains exactly the same £-sentences as does T’. In this sense, 77, 
is a harmless extension of T. Our claim constitutes part of Theorem 6.1. 
For y € L[r] define the reduced formula y" € L as follows: Starting from 
the left, replace every prime formula rt occurring in y by dz(t). Clearly, 


r 


y™ = y, provided r does not appear in ¢. 


Theorem 6.1 (Elimination theorem). Let T, C L[r] be a definitorial 
extension of the theory T C L° by the explicit definition n,. Then for all 
formulas p € Lir] holds the equivalence 


(x) T.F yp & TE. 


For p € L we get in particular T, F p & TF ¢ (since y™ = ~). Hence, 
T, is a conservative extension of T, i.e.,a€T, 3aae€éT, for alla€ L°. 
Proof. Each AF T is expandable to a model A’ — T;. with the same 
domain, setting ra: AF 6 [da] (¢ € A”). Since rf =r, 6(f) for any f, 
we obtain y =r. vy" for all y € L[r] by the replacement theorem. Thus, 
(«) follows from 
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T,F-F peSA'Foforall AFT (MdT,={A'| AE T}) 
& A'E y" for all AFT (because y =r, vy") 
& AF y" for all AFT (Theorem 3.1) 
Pom, LJ 


Operation symbols and constants can be similarly introduced, though 
in this case there are certain conditions to observe. For instance, in T¢ 
(see page 65) the operation ~! is defined by 7: y=a~1 


definition is legitimate, since Tg F Vailyxoy=e; Exercise 3. Only this 


“ xcoy=e. This 


requirement (which by the way is a logical consequence of 7) ensures that 
Tg +1! is a conservative extension of Tg. We therefore extend Definition I 
as follows, keeping in mind that to the end of this section constant symbols 
are to be counted among the operation symbols. 


Definition II. An explicit definition of an n-ary operation symbol f not 
occurring in £ is a formula of the form 

np: y=feio o(#,y) (6€ Land y,21,...,%p distinct). 
ny is called legitimate in T C L if T F Vaealyd, and Ty := T + ng is 
then called a definitorial extension by f, more precisely by nf. In the case 


n = 0 we write c for f and speak of an explicit definition of the constant 
symbol c. Written more suggestively y=c < 6(y). 

Some of the free variables of 6 are often not explicitly named, and thus 
downgraded to parameter variables. More on this will be said in the 
discussion of the axioms for set theory in 3.4. The elimination theorem is 
proved in almost exactly the same way as above, provided 17, is legitimate 
in T. The reduced formula y" is defined correspondingly. For a constant 


c (n = 0 in Definition II), let y’? := Jz(y % 6 Z), where vy 2 denotes the 
result of replacing c in y by z (¢ vary). Now let n > 0. If f does not 
appear in y, set y’ = y. Otherwise, looking at the first occurrence of 
f in yg from the left, we certainly may write y = Yo ft for appropriate 
yo, t, and y ¢ vary. Clearly, y =T, Jy(yo Ay= ft) =r, 1, with 
1 := Ay(yo A Op (t, y)). If f still occurs in y; then repeat this procedure, 


which ends in, say, m steps in a formula y,, that no longer contains f. 
Then put g™@ := gm. 

Frequently, operation symbols f are introduced in more or less strictly 
formalized theories by definitions of the form 


(x) fa:= t(2), 
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where of course f does not occur in the term t(Z). This procedure is in 
fact subsumed by Definition II, because the former is nothing more than 
a definitorial extension of T with the explicit definition 


np: y=feoy=t(aZ). 


This definition is legitimate, since V#i!yy=t(Z) is a tautology. It can 
readily be shown that ng is logically equivalent to V# fZ=t(Z). Hence, 
(*) can indeed be regarded as a kind of an informative abbreviation of a 
legitimate explicit definition with the defining formula y= t(Z). 


Remark 1. Instead of introducing new operation symbols, so-called iota-terms 
from [HB] could be used. For any formula y = y(#,y) in a given language, 
let cyp be a term in which y appears as a variable bound by 1. Whenever 
T F Vealyy, then T is extended by the axiom ViVy[y = eyy(Z, y) — v(2, y)], so 
that cyy(z, y) so to speak stands for the function term f2#, which could have been 
introduced by an explicit definition. We mention that a definitorial language 
expansion is not a necessity. In principle, formulas of the expanded language can 
always be understood as abbreviations in the original language. This is in some 
presentations the actual procedure, though our imagination prefers additional 
notions over long sentences that would arise if we were to stick to a minimal set 
of basic notions. 


Definitions I and II can be unified in a more general declaration. Let 
T, T’ be theories in the languages L, L’, respectively. Then T” is called a 
definitorial extension (or expansion) of T whenever T’ = T + A for some 
list A of explicit definitions of new symbols legitimate in 7’, given in terms 
of those of T (here legitimate refers to operation symbols and constants 
only). A need not be finite, but in most cases it is finite. A reduced 
formula y™ € L is stepwise constructed as above, for every y € L’. 
In this way the somewhat long-winded proof of the following theorem is 
reduced each time to the case of an extension by a single symbol: 


Theorem 6.2 (General elimination theorem). Let T’ be a definitorial 
extension of T. Thena €T' &a% €T. In particular, a€ T' @a€T 
whenever a € L, i.e., T’ is a conservative extension of T. 


A relation or operation symbol s occurring in TC CL is termed explicitly 
definable in T if T contains an explicit definition of s whose defining 
formula belongs to Lo, the language of symbols of T without s. For 
example, in the theory Tg of groups the constant e is explicitly defined 
by =e — xox=2; Example 2 page 83. Another example is presented 
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in Exercise 3. In such a case each model of Tp := TM Lo can be expanded 
in only one way to a T-model. If this special condition is fulfilled then s is 
said to be implicitly definable in T. This could also be stated as follows: if 
T’ is distinct from T only in that the symbol s is everywhere replaced by a 
new symbol s’, then either TUT" F Vz(s@ @ s/%) or TUT’ F Vz(s%#=9'2), 
depending on whether s,s’ are relation or operation symbols. It is highly 


interesting that this kind of definability is already sufficient for the explicit 
definability of sin JT. But we will go without the proof and only quote 
the following theorem. 


Beth’s definability theorem. A relation or operation symbol implicitly 
definable in a theory T is also explicitly definable in T. 


Definitorial expansions of a language should be conscientiously distin- 
guished from expansions of languages that arise from the introduction of 
so-called Skolem functions. These are useful for many purposes and are 
therefore briefly described. 


Skolem normal forms. According to Theorem 4.2, every formula a can 
be converted into an equivalent PNF, a = Q\21---Qpxrza’, where a’ is 


open. Obviously then -a@ = Qy21---Q,2,-70’, where V = 3 and 4 = V. 


Because F qa if and only if sa is unsatisfiable, the decision problem for 
general validity can first of all be reduced to the satisfiability problem 
for formulas in PNF. Using Theorem 6.3 below, the latter—at the cost 
of introducing new operation symbols—is then completely reduced to the 
satisfiability problem for V-formulas. 


Call formulas a and (3 satisfiably equivalent if both are satisfiable (not 
necessarily in the same model), or both are unsatisfiable. We construct 
for every formula, which w.l.o.g. is assumed to be given in prenex form 
a = Q141---Qer,0, a satisfiably equivalent V-formula @ with additional 
operation symbols such that freea@ = freea. The construction of @ will be 


completed after m steps, where m is the number of 4-quantifiers among the 
Q1,..-,Qz. Take @ = ag and a; to be already constructed. If a; is already 


an V-formula let @ = a;. Otherwise a; has the form Vx, ---VapityG; for 
some n > 0. With an n-ary operation symbol f (which is a constant 
in case n=0) not yet used let ajay = VEG; ie Thus, after m steps an 
V-formula @ is obtained such that free@ = freea; this formula @ is called 
a Skolem normal form (SNF) of a. 
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Example 1. If a is the formula Vadya < y then @ is just Vax < fa. 


For a = daeVyx-y=y we have @=Vyc-y=y. 
If a = VaVysz(x < zany <z) then d=Va2Vy(a < fayany < fay). 


Theorem 6.3. Let & be a Skolem normal form for the formula a. Then 


(a) @E a, (b) a is satisfiably equivalent to a. 
Proof. (a): It suffices to show that aj;41 F a; for each of the described 
construction steps. (; se F dyQ; implies aj41 = VER; 2 F Vt sAyB; = ai, 
by (c) and (d) in 2.5. (b): If @ is satisfiable then by (a) so too is a. 
Conversely, suppose A F V#iyG;(#, y, Z) [Z]. For each @ € A” we choose 
some b € A such that A F (6 |a,b,é] (which is possible in view of the 
axiom of choice AC) and expand A to A’ by setting {4d = b for the new 


operation symbol. Then evidently A’ F a;41[é]. Thus, we finally obtain 
a model for @ that expands the initial model. [LJ 


Now, for each a, a tautologically equivalent 4-formula @& is gained as 
well (that is, F a = F &). By the above theorem, we first produce for 
B = 7a a satisfiably equivalent SNF B and put @ := 48. Then indeed 
Fa & Fa, because 


Fa & £ unsatisfiable © B unsatisfiable = F a. 


Example 2. For a := drVy(ry > raz) we have 7a = 2 := Vrdy(rya7ra) 
and 3 = V2(rfxarrx). Thus, & = 73 = 3x(r fx +rx). The last formula 
is a tautology. Indeed, if r4 4 0 then clearly AE J2(rfx >rzx). But the 


same holds if r4 = 0, for then never AF rfx. Thus, & and hence also a 


is a tautology, which is not at all obvious after a first glance at a. This 
shows how useful Skolem normal forms can be for discovering tautologies. 


Remark 2. There are many applications of Skolem normal forms, mainly in 
model theory and in logic programming. For instance, Exercise 5 permits one to 
reduce the satisfiability problem of an arbitrary first-order formula set to a set 
of V-formulas (at the cost of adjoining new function symbols). Moreover, a set 
X of V-formulas is satisfiably equivalent to a set X’ of open formulas as will be 
shown in 4.1, and this problem can be reduced completely to the satisfiability of 
a suitable set of propositional formulas, see also Remark 1 in 4.1. The examples 
of applications of the propositional compactness theorem in 1.5 give a certain 
feeling for how to proceed in this way. 
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Exercises 


1. Suppose that 7; results from a consistent theory T’ by adjoining an 
explicit definition 7 for f and let a” be constructed as explained in 
the text. Show that Ty is a conservative extension of T' if and only 
if 7 is a legitimate explicit definition. 


2. Let S:n++ n+1 denote the successor function in NV = (N,0,8,+,°-). 
Show that ThN is a definitorial extension of Th(N,S,-); in other 
words, 0 and + are explicitly definable by S and - in N. 


1. goy=e is a legitimate explicit definition 


3. Prove that 7: y=a2~ 
in Tg (it suffices to prove Tg F roy=xez—-y=z). Show in ad- 
dition that TE =Tg¢+7. Thus, TE is a definitorial and hence a 
conservative extension of Tg. In this sense, the theories T& and Tg 


are equivalent formulations of the theory of groups. 


4. As is well known, the natural <-relation of N is explicitly definable 
in (N,0,+), for instance, by x < y @ (4z40)z+a2=y. Prove that 
the <-relation of Z is not explicitly definable in (Z,0,+). 


5. Construct to each a € X (C £) an SNF @ such that X is satisfiably 
equivalent to X = {@|a € X} and X E_X, called a Skolemization 
of X. Since we do not suppose that X is countable, the function 


symbols introduced in X must properly be indexed. 


Bibliography 


[AGM] S. ABRAmMSsky, D. M. GaBBay, T. S. E. MAIBAUM (editors), Handbook 


[Ac| 
[Bar| 
[BF] 
[Bel| 
[Be2| 
[Be3| 


[Be4] 


[BM] 


[Ben] 


[BP] 


[BA] 


of Computer Science, I-IV, Oxford Univ. Press, Vol. I, II 1992, Vol. III 
1994, Vol. IV 1995. 


W. ACKERMANN, Die Widerspruchsfretheit der Allgemeinen Mengen- 
lehre, Mathematische Annalen 114 (1937), 305-315. 


J. BARWISE (editor), Handbook of Mathematical Logic, North-Holland 
1977. 


J. BARWISE, S. FEFERMAN (editors), Model-Theoretic Logics, Springer 
1985. 


L. D. BEKLEMISHEV, On the classification of propositional provability 
logics, Math. USSR — Izvestiya 35 (1990), 247-275. 

, Iterated local reflection versus iterated consistency, Ann. Pure 
Appl. Logic 75 (1995), 25-48. 
, Bimodal logics for extensions of arithmetical theories, J. Symb. 
Logic 61 (1996), 91-124. 
, Parameter free induction and reflection, in Computational Logic 
and Proof Theory, Lecture Notes in Computer Science 1289, Springer 
1997, 103-113. 


J. BELL, M. MACHOVER, A Course in Mathematical Logic, North- 
Holland 1977. 


M. Ben-ArI, Mathematical Logic for Computer Science, New York 
1993, 2°¢ ed. Springer 2001. 


P. BENACERRAF, H. PUTNAM (editors), Philosophy of Mathematics, 
Selected Readings, Englewood Cliffs NJ 1964, 2°4 ed. Cambridge Univ. 
Press 1983, reprint 1997. 


A. BeErRARDUCCI, P. D’AQUINO, Ao-complexity of the relation 
y=T[,<, (@), Ann. Pure Appl. Logic 75 (1995), 49-56. 


299 


300 


Bibliography 


[Bil 


[Boo| 
[BJ] 


[BGG] 


[Bue] 
[Bu] 
[Ca] 


[CZ| 
[CK] 


[Ch] 


[CM] 


[Da] 
[Dav] 
[Daw] 


[De| 
[Do] 
[EFT] 


[FF] 


[Fel] 


[Fe2] 
[Fel1] 


G. BIRKHOFF, On the structure of abstract algebras, Proceedings of the 
Cambridge Philosophical Society 31 (1935), 433-454. 


G. BooLos, The Logic of Provability, Cambridge Univ. Press 1993. 


G. Boo.os, R. JEFFREY, Computability and Logic, 3° ed. Cambridge 
Univ. Press 1989. 


E. BORGER, E. GRADEL, Y. GUREVICH, The Classical Decision Prob- 
lem, Springer 1997. 


S. BUECHLER, Essential Stability Theory, Springer 1996. 
S. R. Buss (editor), Handbook of Proof Theory, Elsevier 1998. 


G. CANTOR, Gesammelte Abhandlungen (editor E. ZERMELO), Berlin 
1932, Springer 1980. 


A. CHacrov, M. ZAKHARYASHEV, Modal Logic, Clarendon Press 1997. 


C. C. CHANG, H. J. KEISLER, Model Theory, Amsterdam 1973, 3°¢ ed. 
North-Holland 1990. 


A. CuHuRCH, A note on the Entscheidungsproblem, J. Symb. Logic 1 
(1936), 40-41, also in [Dav, 108-109]. 


W. CLocksIN, C. MELLISH, Programming in PROLOG, 3"¢ ed. Sprin- 
ger 1987. 


D. vAN DALEN, Logic and Structure, Berlin 1980, 4* ed. Springer 2004. 
M. Davis (editor), The Undecidable, Raven Press 1965. 


J. W. Dawson, Logical Dilemmas, The Life and Work of Kurt Godel, 
A. K. Peters 1997. 


O. DEISER, Aziomatische Mengenlehre, Springer, to appear 2010. 
K. DoETs, From Logic to Logic Programming, MIT Press 1994. 


H.-D. EBBINGHAUS, J. FLUM, W. THOMAS, Mathematical Logic, New 
York 1984, 2"¢ ed. Springer 1994. 


A. B. FEFERMAN, S. FEFERMAN, Alfred Tarski, Live and Logic, Cam- 
bridge Univ. Press 2004. 


5S. FEFERMAN, Arithmetization of metamathematics in a general setting, 
Fund. Math. 49 (1960), 35-92. 


____, In the Light of Logic, Oxford Univ. Press 1998. 
W. FELSCHER, Berechenbarkeit, Springer 1993. 


Bibliography 301 


[Fel2] 


[Fi] 
[Fr] 


[Fre] 


[FS] 


[Ga] 


[GJ] 


[Gel 


[Go1 


[GQ] 


[G3] 


[Gor] 


[Gr] 
[HP] 


[Hei] 


[He 


[Her] 


, Lectures on Mathematical Logic, Vol. 1-3, Gordon & Breach 
2000. 


M. FITTING, Incompleteness in the Land of Sets, College Publ. 2007. 


T. FRANZEN, Gédel’s Theorem: An Incomplete Guide to Its Use and 
Abuse, A. K. Peters 2005. 


G. FREGE, Begriffsschrift, eine der arithmetischen nachgebildete For- 
melsprache des reinen Denkens, Halle 1879, G. Olms Verlag 1971, also 
in [Hei, 1-82]. 


H. FRIEDMAN, M. SHEARD, Elementary descent recursion and proof 
theory, Ann. Pure Appl. Logic 71 (1995), 1-47. 


D. GAsBBay, Decidability results in non-classical logic III, Israel Journal 
of Mathematics 10 (1971), 135-146. 


M. Garey, D. JOHNSON, Computers and Intractability, A Guide to the 
Theory of NP-Completeness, Freeman 1979. 


G. GENTZEN, The Collected Papers of Gerhard Gentzen (editor M. E. 
SZABO), North-Holland 1969. 


K. GODEL, Die Vollstdndigkeit der Axiome des logischen Funktionen- 
kalktils, Monatshefte f. Math. u. Physik 37 (1930), 349-360, also in [G63, 
Vol. I, 102-123], [Hei, 582-591]. 

, Uber formal unentscheidbare Sétze der Principia Mathematica 
und verwandter Systeme I, Monatshefte f. Math. u. Physik 38 (1931), 
173-198, also in [G63, Vol. I, 144-195], [Hei, 592-617], [Dav, 4-38]. 

, Collected Works (editor S. FEFERMAN), Vol. I-V, Oxford 
Univ. Press, Vol. I 1986, Vol. II 1990, Vol. III 1995, Vol. IV, V 2003. 


S. N. GORYACHEV, On the interpretability of some extensions of arith- 
metic, Mathematical Notes 40 (1986), 561-572. 


G. GRATZER, Universal Algebra, New York 1968, 2"¢ ed. Springer 1979. 


P. HAJEK, P. PUDLAK, Metamathematics of First-Order Arithmetic, 
Springer 1993. 


J. VAN HEIJENOORT (editor), From Frege to Gédel, Harvard Univ. Press 
1967. 


L. HENKIN, The completeness of the first-order functional calculus, J. 
Symb. Logic 14 (1949), 159-166. 


J. HERBRAND, Recherches sur la théorie de la démonstration, C. R. 
Soc. Sci. Lett. Varsovie, Cl. IIT (1930), also in [Hei, 525-581]. 


302 Bibliography 

[HR] H. Herre, W. RAUTENBERG, Das Basistheorem und einige Anwen- 
dungen in der Modelltheorie, Wiss. Z. Humboldt-Univ., Math. Nat. R. 
19 (1970), 579-583. 

[HeR] B. HERRMANN, W. RAUTENBERG, Finite replacement and finite 
Hilbert-style axiomatizability, Zeitsch. Math. Logik Grundlagen Math. 
38 (1982), 327-344. 

[HA] D. Hi-BerT, W. ACKERMANN, Grundztige der theoretischen Logik, 
Berlin 1928, 6* ed. Springer 1972. 

[HB] OD. Hivperr, P. BERNAys, Grundlagen der Mathematik, I, II, Berlin 
1934, 1939, 2°¢ ed. Springer, Vol. I 1968, Vol. II 1970. 

[Hi] P. HINMAN, Fundamentals of Mathematical Logic, A. K. Peters 2005. 

[Ho] |W. HopceEs, Model Theory, Cambridge Univ. Press 1993. 

[Hor] A. HORN, On sentences which are true of direct unions of algebras, J. 
Symb. Logic 16 (1951), 14-21. 

[Hu] T. W. HuNGERFORD, Algebra, Springer 1980. 

[Id] P. IpziAk, A characterization of finitely decidable congruence modular 
varieties, Trans. Amer. Math. Soc. 349 (1997), 903-934. 

[Ig] K. IGNATIEV, On strong provability predicates and the associated modal 
logics, J. Symb. Logic 58 (1993), 249-290. 

[JK] R. JENSEN, C. KARP, Primitive recursive set functions, in Axiomatic 
Set Theory, Vol. I (editor D. ScoTrT), Proc. Symp. Pure Math. 13, I 
AMS 1971, 143-167. 

[Ka] R. KAYE, Models of Peano Arithmetic, Clarendon Press 1991. 

[Ke] H. J. KEISLER, Logic with the quantifier “there exist uncountably many”, 
Annals of Mathematical Logic 1 (1970), 1-93. 

[Kl1] S. KLEEN, Introduction to Metamathematics, Amsterdam 1952, 2" ed. 
Wolters-Noordhoff 1988. 

[K12] , Mathematical Logic, Wiley & Sons 1967. 

[KR] I. Korec, W. RAUTENBERG, Model interpretability into trees and ap- 
plications, Arch. math. Logik 17 (1976), 97-104. 

[Kr] M. KrRacut, Tools and Techniques in Modal Logic, Elsevier 1999. 

[Kra] J. KRAJIGEK, Bounded Arithmetic, Propositional Logic, and Complexity 


Theory, Cambridge Univ. Press 1995. 


Bibliography 303 


[Kk] 


[Ku] 


[Le] 
[Li] 


[L]] 


[L6] 


[Ms] 


[Ma] 


[Mal] 
[Man] 


[Mar] 
[Mat] 
[MV] 


[Me| 


[Mo] 
[Moo] 


[ML] 


[Po] 


G. KREISEL, J.-L. KRIVINE, Elements of Mathematical Logic, North- 
Holland 1971. 


K. KuNEN, Set Theory, An Introduction to Independence Proofs, North- 
Holland 1980. 


A. Levy, Basic Set Theory, Springer 1979. 


P. LINDSTROM, On eatensions of elementary logic, Theoria 35 (1969), 
1-11. 


J. W. LLoyp, Foundations of Logic Programming, Berlin 1984, 2°¢ ed. 
Springer 1987. 


M. L6B, Solution of a problem of Leon Henkin, J. Symb. Logic 20 
(1955), 115-118. 


A. MACINTYRE, H. SIMMONS, Goédel’s diagonalization technique and 
related properties of theories, Colloquium Mathematicum 28 (1973), 
165-180. 


A. I. MAu’cEv, The Metamathematics of Algebraic Systems, North- 
Holland 1971. 


J. MALITZ, Introduction to Mathematical Logic, Springer 1979. 


Y. 1. MANIN, A Course in Mathematical Logic for Mathematicians, 
New York 1977, 2"¢ ed. Springer 2010. 


D. MARKER, Model Theory, An Introduction, Springer 2002. 
Y. MATIYASEVICH, Hilbert’s Tenth Problem, MIT Press 1993. 


R. MCKENZIE, M. VALERIOTE, The Structure of Decidable Locally 
Finite Varieties, Progress in Mathematics 79, Birkhauser 1989. 


E. MENDELSON, Introduction to Mathematical Logic, Princeton 1964, 
4th ed. Chapman & Hall 1997. 


D. Monk, Mathematical Logic, Springer 1976. 


G. H. Moore, The emergence of first-order logic, in History and Phi- 
losophy of Modern Mathematics (editors W. ASPRAY, P. KITCHER), 
University of Minnesota Press 1988, 95-135. 


G. MULLER, W. LENSKI (editors), The 0Q-Bibliography of Mathematical 
Logic, Springer 1987. 


W. POHLERS, Proof Theory, An Introduction, Lecture Notes in Mathe- 
matics 1407, Springer 1989. 


304 


Bibliography 


[Pz] 
[Pr] 


[RS] 

[Ral] 
[RaQ] 
[Ra3| 
[Rad] 


[Rad] 


[RZ 


[Rol] 


[Ro2] 


[Rob] 


[Rog] 


[Ros] 


[Rot] 
[Ry] 


[Sal 
[Sam] 


B. PorzaT, A Course in Model Theory, Springer 2000. 


M. PRESBURGER, Uber die Vollsténdigkeit eines gewissen Systems der 
Arithmetik ganzer Zahlen, in welchem die Addition als einzige Opera- 
tion hervortritt, Congrés des Mathématiciens des Pays Slaves 1 (1930), 
92-101. 


H. Rasrowa, R. Sikorski, The Mathematics of Metamathematics, 
Warschau 1963, 3'¢ ed. Polish Scientific Publ. 1970. 


W. RAUTENBERG, Klassische und Nichtklassische Aussagenlogik, Vie- 
weg 1979. 


, Modal tableau calculi and interpolation, Journ. Phil. Logic 12 
(1983), 403-423. 


, A note on completeness and maximality in propositional logic, 
Reports on Mathematical Logic 21 (1987), 3-8. 

, Linfiihrung in die mathematische Logik, Wiesbaden 1996, 
3'¢ ed. Vieweg+Teubner 2008. 


, Messen und Zdhlen, Eine einfache Konstruktion der reellen 


Zahlen, Heldermann 2007. 


W. RAUTENBERG, M. ZIEGLER, Recursive inseparability in graph the- 
ory, Notices Amer. Math. Soc. 22 (1975), A-523. 


A. RoBINsoN, Introduction to Model Theory and to the Metamathemat- 
ics of Algebra, Amsterdam 1963, 2°¢ ed. North-Holland 1974. 


, Non-Standard Analysis, Amsterdam 1966, 3 ed. North- 
Holland 1974. 


J. ROBINSON, A machine-oriented logic based on the resolution princi- 
ple, Journal of the ACM 12 (1965), 23-41. 


H. RoGERS, Theory of Recursive Functions and Effective Computability, 
New York 1967, 2"¢ ed. MIT Press 1988. 


J. B. Rosser, Extensions of some theorems of Géddel and Church, J. 
Symb. Logic 1 (1936), 87-91, also in [Dav, 230-235]. 


P. ROTHMALER, Introduction to Model Theory, Gordon & Breach 2000. 


C. RYLL-NARDZEWKI, The role of the axiom of induction in elemenary 
arithmetic, Fund. Math. 39 (1952), 239-263. 


G. SACKS, Saturated Model Theory, W. A. Benjamin 1972. 


G. SAMBIN, An effective fixed point theorem in intuitionistic diagonal- 
izable algebras, Studia Logica 35 (1976), 345-361. 


Bibliography 305 


[Sc] 
[Se] 


[Sh] 


[She] 


[Shoe] 


[Si] 
[Sm] 


[Smo] 
[Smu] 
[So] 


[Sz] 


[Tak] 
[Tal] 


[Ta2] 


[Ta3] 


[Tad] 


[TMR] 


[TV] 


U. SCHONING, Logic for Computer Scientist, Birkhauser 1989. 


A. SELMAN, Completeness of calculi for axiomatically defined classes of 
algebras, Algebra Universalis 2 (1972), 20-32. 


S. SHAPIRO (editor), The Oxford Handbook of Philosophy of Mathemat- 
ics and Logic, Oxford Univ. Press 2005. 


S. SHELAH, Classification Theory and the Number of Nonisomorphic 
Models, Amsterdam 1978, 2°¢ ed. North-Holland 1990. 


J. R. SHOENFIELD, Mathematical Logic, Reading Mass. 1967, A. K. 
Peters 2001. 


W. SieEG, Herbrand analyses, Arch. Math. Logic 30 (1991), 409-441. 


P. SmitH, An Introduction to Gédel’s Theorems, Cambridge Univ. 
Press 2007. 


C. SMORYNSKI, Self-reference and Modal Logic, Springer 1984. 
R. SMULLYAN, Theory of Formal Systems, Princeton Univ. Press 1961. 


R. SOLOVAY, Provability interpretation of modal logic, Israel Journal of 
Mathematics 25 (1976), 287-304. 


W. SZMIELEW, Elementary properties of abelian groups, Fund. Math. 
41 (1954), 203-271. 


G. TAKEUTI, Proof Theory, Amsterdam 1975, 2"? ed. Elsevier 1987. 
A. TARSKI, Der Wahrheitsbegriff in den formalisierten Sprachen, Studia 
Philosophica 1 (1936), 261-405, also in [Ta4, 152-278]. 


, Introduction to Logic and to the Methodology of Deductive Sci- 
ences, Oxford 1941, 3'¢ ed. Oxford Univ. Press 1965 (first edition in 
Polish, 1936). 


, A Decision Method for Elementary Algebra and Geometry, 
Santa Monica 1948, Berkeley 1951, Paris 1967. 


, Logic, Semantics, Metamathematics (editor J. CORCORAN), 
Oxford 1956, 2°¢ ed. Hackett 1983. 


A. Tarski, A. MostTowskI1, R. M. ROBINSON, Undecidable Theories, 
North-Holland 1953. 


A. TARSKI, R. VAUGHT, Arithmetical extensions and relational systems, 
Compositio Mathematica 13 (1957), 81-102. 


306 


Bibliography 


[Tu] 


[Vil| 


[Vi2| 


[Wae] 


[Wag] 
[Wal] 


[Wa2] 
[Wa3] 


[WR| 


[Wij 


[WP] 


[Zi] 


A. TURING, On computable numbers, with an application to the Ent- 
scheidungsproblem, Proc. London Math. Soc., 2"¢ Ser. 42 (1937), 
230-265, also in [Dav, 115-154]. 


A. VISSER, Aspects of Diagonalization and Provability, Dissertation, 
University of Utrecht 1981. 

, An overview of interpretability logic, in Advances in Modal 
Logic, Vol. 1 (editors M. KRACHT et al.), CSLI Lecture Notes 87 (1998), 
307-359. 


B. L. VAN DER WAERDEN, Algebra I, Berlin 1930, 4** ed. Springer 
1955. 


F. WAGNER, Simple Theories, Kluwer Academic Publ. 2000. 


H. WANG, From Mathematics to Philosophy, Routlegde & Kegan Paul 
1974. 

___, Computer, Logic, Philosophy, Kluwer Academic Publ. 1990. 

, A Logical Journey, From Gédel to Philosophy, MIT Press 1997. 


A. WHITEHEAD, B. RUSSELL, Principia Mathematica, I-III, Cambridge 
1910, 1912, 1913, 2"¢ ed. Cambridge Univ. Press, Vol. I 1925, Vol. II, 
IIT 1927. 


A. WILKIE, Model completeness results for expansions of the ordered 
field of real numbers by restricted Pfaffian functions and the exponential 
function, Journal Amer. Math. Soc. 9 (1996), 1051-1094. 


A. WILKIE, J. PARIS, On the scheme of induction for bounded arith- 
metic formulas, Ann. Pure Appl. Logic 35 (1987), 261-302. 


M. ZIEGLER, Model theory of modules, Ann. Pure Appl. Logic 26 (1984), 
149-213. 


Index of Terms and Names 


A 

a.c. (algebraically closed), 48 
V-formula, V-sentence, 68 
V-theory, 83 

Va-sentence, Vi-theory, 190 
abelian group, 47 


divisible, torsion-free, 104 
absorption laws, 48 
Ackermann function, 226 
Ackermann interpretation, 261 
algebra, 42 
algebraic, 48 
almost all, 60, 210 
alphabet, xx, 54 
antisymmetric, 45 
arithmetical, 238 
arithmetical hierarchy, 264 
arithmetization (of syntax), 226 
associative, xxi 
automated theorem proving, 120 
automorphism, 50 
axiom 

of choice, 115 

of continuity, 108 

of extensionality, 113 

of foundation, 115 

of infinity, 115 

of power set, 114 

of replacement, 114 


of union, 113 
axiom system 

logical, 36, 121 

of a theory, 82 
axiomatizable, 104 

finitely, recursively, 104 


B 
-function, 244 
basis theorem 
for formulas, 206 
for sentences, 180 
Behmann, 125 
Birkhoff rules, 127 
Boolean algebra, 49 
atomless, 202 
of sets, 49 
Boolean basis 
for £ in T, 206 
for £L° in T, 180 
Boolean combination, 57 
Boolean function, 2 
dual, self-dual, 15 
linear, 10 
monotonic, 16 
Boolean matrix, 49 
Boolean signature, 5 
bounded, 46 


Brouwer, xvii 
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C for », 123 
Cantor, 111 for G, 286 
cardinal number, 173 for 2 124 


cardinality, 173 

of a structure, 174 
chain, 46 

of structures, 190 

elementary, 191 

of theories, 103 
characteristic, 48 
Chinese remainder theorem, 244 
Church, xviii, 117 
Church’s thesis, 220 
clause, 144, 151 

definite, 144 

positive, negative, 144 
closed under MP, 37 
closure 

deductive, 20 

of a formula, 64 

of a model in JT, 197 
closure axioms, 258 
cofinite, 34 
Cohen, xviii 
coincidence theorem, 66 
collision of variables, 70 
collision-free, 70 
commutative, xxi 
compactness theorem 

first-order, 105 

propositional, 29 
compatible, 82, 254 
complementation, xix 
completeness theorem 

Birkhoff’s, 128 

first-order (Gédel’s), 102 


propositional, 29 
Solovay’s, 288 
completion, 120 
inductive, 194 
composition, xx, 217 
computable, 217 
concatenation, xx 
arithmetical, 224 
congruence, 12, 51 
in £, 74 
congruence class, 51 
conjunction, 2 
connective, 3 
connex, 45 
consequence relation, 20 
local, global, 80 
predicate logical, 64 
propositional, 20 
structural, 20 
consistency extension, 284 
consistent, 20, 26, 82, 96, 158 
constant, xxi 
constant expansion, 97 
constant quantification, 98 
continuity schema, 110 
continuum hypothesis, 174 
contradiction, 17 
contraposition, 21 
converse implication, 4 
coprime, 239 
course-of-values recursion, 224 
cut, 46 
cut rule, 24 
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D 
A-elementary class, 180 
6-function, 219 
Ao-formula, 238 
Ao-induction, 265 
Davis, 256 
decidable, 104, 218 
deduction theorem, 21, 38 
deductively closed, 20, 82 
definable 
Ao-definable, 273 
explicitly, 67, 87 
implicitly, 88 
in a structure, 67 
in a theory, 272 
&1-definable, 273 
with parameters, 108 
DeJongh, 290 
derivability conditions, 270 
derivable, 22, 23, 36, 92 
derivation, 23 
diagram, 170 
elementary, 172 
universal, 192 
direct power, 52 
disjunction, 3 
exclusive, 3 
distributive laws, 49 
divisibility, 239 
domain, xix, 42 
of magnitude, 47 


E 


4formula, 68 


simple, 203 
Ehrenfeucht game, 183 
elementary class, 180 


elementary equivalent, 69 
elementary type, 180 
embedding, 50 
elementary, 176 
trivial, 170 
end extension, 107 
enumerable 
effectively, 117 
recursively, 225 
equation, 57 
Diophantine, 238, 255 
equipotent, 111 
equivalence, 3 
equivalence class, 45 
equivalence relation, 45 
equivalent, 11, 64 
in (or modulo) T, 83 
in a structure, 74 


logically or semantically, 11, 64 


Euclid’s lemma, 244 
existentially closed, 191, 200 
expansion, 45, 78, 87 
explicit definition, 85, 86 
extension, 44, 78, 81 
conservative, 66, 85 
definitorial, 87 
elementary, 171 
finite, 82 
immediate, 197 
of a theory, 81 
transcendental, 179 


F 

f-closed, 44 

factor structure, 51 
falsum, 5 

family (of sets), xix 
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Fermat’s conjecture, 257 
Fibonacci sequence, 224 
fictional argument, 10 
field, 47 
algebraically closed, 48 
of algebraic numbers, 173 
of characteristic 0 or p, 48 
ordered, 48 
real closed, 197 
filter, 34 
finite model property, 125 
finitely generated, 45 
finiteness theorem, 26, 29, 94, 103 
fixed point lemma, 250 
formula, 56 
arithmetical, 238, 272 
Boolean, 5 
closed, 59 
defining, 85 
dual, 15 
first-order, 56 
open (quantifier-free), 57 
prenex, 77 
representing, 9, 237 
universal, 68 
formula algebra, 42 
formula induction, 6, 58 
formula recursion, 8, 58 
four-color theorem, 32 
Frege, 76 
Frege’s formula, 18 
function, xix 
bijective, xix 
characteristic, 218 
identical, xix 
injective, surjective, xix 


partial, 178 
primitive recursive, 218 
recursive (= si-recursive), 218 
function term, 55 
functional complete, 14 


G 
G-frame, 285 
gap, 46 
generalization, 79 
anterior, posterior, 79 
generalized of a formula, 64 
generally valid, 64 
Gentzen calculus, 22 
goal clause, 158 
Godel, xvii, 91, 243, 290 
Gédel number, 223 
of a proof, 229 
of a string, 227 
Godel term, 246 
Gédelizable, 228 
Goldbach’s conjecture, 256 
graph, 46 
k-colorable, 31 
of an operation, xxi 
planar, simple, 31 
ground instance, 138, 158 
ground (or constant) term, 55 
group, groupoid, 47 
ordered, 47 


H 

H-resolution, 149 

Harrington, 282 

Henkin set, 99 

Herbrand model, 103, 138 
minimal, 142 
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Herbrand universe, 138 
Hilbert, xvii 
Hilbert calculus, 35, 121 
Hilbert’s program, xvii, 216 
homomorphism, 50 
canonical, 51 
strong, 50 
homomorphism theorem, 51 
Horn clause, 149 
Horn formula, 140 
basic, universal, 140 
positive, negative, 140 
Horn sentence, 140 
Horn theory, 141 
universal, nontrivial, 141 
hyperexponentiation, 239 


I 
i-term, 87 
I-tuple, xx 


idempotent, xxi 
identity, 127 
identity-free (=-free), 102 
immediate predecessor, 46 
immediate successor, 46 
implication, 4 
incompleteness theorem 

first, 251 

second, 280 
inconsistent, 26, 96 
independent (of T’), 83 
individual variable, 53 
induction 

on y, 8, 58 

on t, 55 
<-induction, 110 
induction axiom, 108 


induction hypothesis, 106 
induction schema, 106 
induction step, 106 
infimum, 48 
infinitesimal, 109 
instance, 137, 158 
integral domain, 48 
(relatively) interpretable, 258 
interpretation, 62 
intersection, x1x 
invariance theorem, 69 
invertible, xxi 
irreflexive, 45 
isomorphism, 50 

partial, 178 


J 
Jeroslow, 290 
jump, 46 


K 
kernel, 51 
of a prenex formula, 77 
Kleene, 218, 264 
Konig’s lemma, 32 
Kreisel, 257, 290 
Kripke frame, 285 
Kripke semantics, 285 


L 
£-formula, 57 
L£-model, 62 


£-structure (=L-structure), 43, 44 


language 
arithmetizable, 228 


first-order (or elementary), 54 


of equations, 127 
second-order, 130 
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lattice, 48 Mostowski, 216, 243, 291 
distributive, 49 
istributive, N 
of sets, 49 


legitimate, 86 
Lindenbaum, 27 
Lindstr6m’s criterion, 201 
literal, 13, 57 

Léb, 290 

L6b’s formula, 285 

logic program, 157 
logical matrix, 49 
logically valid, 17, 64 


M 
p-operation, 218 
bounded, 221 
mapping (see function), xix 
Matiyasevich, 255 
ye-maximal, 40 
maximal element, 46 
maximally consistent, 20, 26, 30, 96 
metainduction, xvi, 236 
metatheory, xvi 
model 
free, 142 
minimal, 150 
of a theory, 81 
predicate logical, 62 
propositional, 8 
transitive, 296 
model companion, 202 
model compatible, 193 
model complete, 194 
model completion, 200 
model interpretable, 261 
modus ponens, 19, 35 
monotonicity rule, 22 


n-tuple, xx 
negation, 2 
neighbor, 31 
nonrepresentability lemma, 251 
nonstandard analysis, 109 
nonstandard model, 107 
nonstandard number, 107 
normal form 
canonical, 14 
disjunctive, conjunctive, 13 
prenex, 77 


Skolem, 88 
O 


w-consistent, 251 
w-incomplete, 253 
w-rule, 291 
w-term, 115 
object language, xv 
operation, xx 
essentially n-ary, 10 
order, 46 
continuous, dense, 46 
discrete, 183 
linear, partial, 46 
ordered pair, 114 
ordinal rank, 296 


P 

p.r. (primitive recursive), 218 
II,-formula, 238 

pair set, 114 

pairing function, 222 
parameter definable, 108 
parenthesis economy, 6, 57 
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Paris, 282 

partial order, 46 
irreflexive, reflexive, 46 

particularization, 79 
anterior, posterior, 79 

Peano arithmetic, 106 

Peirce’s formula, 18 

persistent, 189 

Polish (prefix) notation, 7 

(monic) polynomial, 105 

poset, 46 

power set, xix 

predecessor function, 134 

predicate, xx 
arithmetical, 238 
Diophantine, 238 
primitive recursive, 218 
recursive, 218 


recursively enumerable, 225 


preference order, 296 
prefix, 57 
premise, 22 
preorder, 46 
Presburger, 204 
prime field, 48 
prime formula, 4, 57 
prime model, 171 
elementary, 171 
prime term, 55 
primitive recursion, 218 
principle of bivalence, 2 
principle of extensionality, 2 
product 
direct, 52 
reduced, 210 
programming language, 132 


projection, 52 

projection function, 218 

PROLOG, 157 

(formal) proof, 36, 122 

propositional variable, 4 
modalized, 289 

provability logic, 285, 288 

provable, 22, 23, 36 

provably recursive, 273 

Putnam, 256 


Q 


quantification, 56 
bounded, 221, 238 

quantifier, 41 

quantifier compression, 243 

quantifier elimination, 202 

quantifier rank, 58 


quasi-identity, quasi-variety, 128 


query, 157 
quotient field, 188 


R 


r.e. (recursively enumerable), 225 


Rabin, 258 

range, X1x 

rank (of a formula), 8, 58 
recursion equations, 218 
recursive definition, 8 
reduced formula, 85, 86 
reduct, 45, 78 

reduct theory, 84 
reductio ad absurdum, 23 
reflection principle, 283, 294 
reflexive, 45 

refutable, 83 

relation, xix 
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relativised of a formula, 258 provable, 278 
renaming, 76, 153 »-formula, 238 
bound, free, 76 special, 267 
replacement theorem, 12, 74 Sambin, 290 
representability satisfiability relation, 17, 62 
of Boolean functions, 9 satisfiable, 17, 64, 82, 144 
of functions, 241 satisfiably equivalent, 88 
of predicates, 237 scope (of a prefix), 58 
representability theorem, 246 segment, xx 
(successful) resolution, 146 initial, xx, 46 
resolution calculus, 145 terminal, xx 
resolution closure, 145 semigroup, 47 
resolution rule, 145 free, 47 
resolution theorem, 148, 151, 167 ordered,regular, 47 
resolution tree, 146 semilattice, 48 
resolvent, 145 semiring, 48 
restriction, 44 ordered, 48 
ring, 47 sentence, 59 


ordered, 48 
Abraham Robinson, 109 
Julia Robinson, 256 


separator, 156 
sequence, Xx 


: ; ; sequent, 22 
Robinson’s arithmetic, 234 mice. 
Rogers, 290 
set 
a a countable, uncountable, 111 
basic, 22, 92 


densely ordered, 46 
discretely ordered, 183 
finite, 111 

ordered, 46 

partially ordered, 46 


derivable (provable), 23 
Gentzen-style, 25 
Hilbert-style, 121 
of Horn resolution, 149 
sound, 25, 93 
rule induction, 25, 94 
Russell, xvii 


transitive, 296 
well-ordered, 46 


Russell’s antinomy, 73 DR CHeE ANCHO 


signature 
S algebraic, 57 
S-invariant, 186 extralogical, 43 


»11-completeness, 239 logical, 4 
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signum function, 220 
singleton, 152 
Skolem, xvii 

Skolem function, 88 
Skolem’s paradox, 116 
Skolemization, 90 
SLD-resolution, 162 
solution, 158 
soundness, 26, 94 
SP-invariant, 188 


Stone’s representation theorem, 49 


string, xx 
atomic, xx 
structure, 42 
algebraic, relational, 42 
subformula, 7, 58 
substitution, 59 
global, 60 
identical, 60 
propositional, 19 
simple, 59 
simultaneous, 59 
substitution function, 248 
substitution invariance, 127 
substitution theorem, 71 
substring, xx 
substructure, 44 
(finitely) generated, 45 
elementary, 171 
substructure complete, 207 
subterm, 56 
subtheory, 81 
successor function, 105 
supremum, 49 
symbol, xx 
extralogical, logical, 54 


of T, 81 


symmetric, 45 


system (of sets), xix 


© 


T-model, 82 

Tarski, 20, 169, 216 

Tarski fragment, 260 
Tarski-Lindenbaum algebra, 84 


tautologically equivalent, 78 
tautology, 17, 64 
term equivalent, 15 


term function, 67 


term induction, 55 
term model, 100, 136 
term, term algebra, 55 


tertium non datur, 17 


theorem 


Artin’s, 197 

Cantor’s, 111 
Cantor—Bernstein, 174 
Dzhaparidze’s, 293 
Goodstein’s, 282 
Goryachev’s, 295 
Herbrand’s, 139 
Lagrange’s, 255 
Lindenbaum’s, 27 
Lindstr6ém’s, 129 
Léb’s, 282 

Los’s, 211 
Léwenheim—Skolem, 112, 175 
Morley’s, 179 
Rosser’s, 252 

Shelah’s, 211 
Steinitz’s, 197 
Trachtenbrot’s, 125 
Visser’s, 289 
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theory, 81 
(finitely) axiomatizable, 104 
arithmetizable, 250 
complete, 105 
consistent (satisfiable), 82 
countable, 111 
decidable, 119, 228 
elementary or first-order, 81 
equational, 127 
essentially undecidable, 257 
hereditarily undecidable, 254 
inconsistent, 82 
inductive, 191 
k-categorical, 176 
quasi-equational, 128 
strongly undecidable, 254 
undecidable, 119 
universal, 83 

transcendental, 48 

transitive, 45 

(directed) tree, 32 

true, 253 

truth function, 2 

truth functor, 3 

truth table, 2 

truth value, 2 

Turing machine, 220 


U 
U-resolution, 162 
U-resolvent, 161 
UH-resolution, 162 
ultrafilter, 34 
nontrivial, 34 
ultrafilter theorem, 35 
ultrapower, 211 
ultraproduct, 211 


undecidable, 104, 119 
unifiable, 152 
unification algorithm, 153 
unifier, 152 
generic, 153 
union, xix 
unique formula reconstruction, 7 
unique formula reconstruction prop- 
erty, 58 
unique term concatenation, 55 
unique term reconstruction, 55 
unit element, 47 
universal closure, 64 
universal part, 187 
universe, 114 
urelement, 112 


Vv 
valuation, 8, 62, 285 
value matrix, 2 
variable, 54 

free, bound, 58 
variety, 127 
Vaught, 179 
verum, 5 


W 

w.l.o.g., Xxi 

word (over A), xx 
word semigroup, 47 


Z 

Z-group, 205 
zero-divisor, 48 
Zorn’s lemma, 46 
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64 
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(A, @) Fy 67 PA 106 Re 145 
AF y [dj 67 Is 106 HR, 149 
yA 67 n (=S"0) 107 ?, N 149 
An, Jen 68 IA 108 HR(P, N) 149 
yh 69 M~N 111 Vo, wp, p, 150 
A=B 69 ZFC, ZF 112 K°? 152 
Me 71 AE, AS, AU 113 Pie 157 
A! 72 {zex|yp} 113 GI(K) 158 
=A, =k 75 AP, AR 114 sum 159 
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